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•terminus or 5') to end (C-terminus or 3'), such that for an alignment 



that extends top monomers (where p>x) there are p-x+l such windows, each window has 
at least xy identical aligned monomers, where: x is sleeted from 20, 25, 30, 35, 40, 45, 50, 
60, 70, 80, 90, 100, 150, 200; y is selected from 0.50, 0.60, 0.70, 0.75, 0.80, 0.85, 0.90, 
0.91, 0.92, 0.93, 0.94, 0.95, 0.96, 0.97, 0.98, 0.99; and ifx-y is not an integer then it is 
rounded up to the nearest integer. The preferred pairwise alignment algorithm is the 
Needleman-Wunsch global alignment algorithm [Needlman &Wunsch ( 1 970) MoL 
Biol 48, 443-453], using default parameters {e.g., with Gap opening penalty = 10.0, and 
with Gap extension penalty = 0.5, using the EBLOSUM62 scoring matrix). This 
algorithm is conveniently implemented in the needle tool in the EMBOSS package [Rice 
etal (2000) Trends Genet. 16:276-277]. 

The nucleic acids and polypeptides of the mention may additionally have further sequences to 
the N-terminus/5' and/or C-terminus/3 ' of these sequences (a) to (d). 

All of the Gram positive bacterial sequences referenced herein are publicly available through 
PubMed on GenBank. 

Streptococcus pneumoniae Adhesin Island Sequences 

As discussed above, a S. pneumoniae AI sequence is present in the TIGR4 S. pneumoniae 
genome. Examples ofS. pneumoniae AI sequences are set forth below. 

SrtD (Sp0468) is a sortase. An example of an amino acid sequence of SrtD is set forth in 
SEQ ID NO: 80. 
SEQ ID NO: 80 

MSRTKLRALLGYLLMLVACLIPIYCFGQMVLQSLGQVKGHATFVKSMTTEMYQEQQNHSLAYNQRLASQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKS D PQT AAVARVAFT KE GQS VS RVAT S QWL YRGL VVL AFLG IL FVLWKL ARLL RGK 

SrtC (Sp0467) is a sortase. An example of an amino acid sequence of SrtC is set forth in SEQ 
ID NO: 81. 
SEQ ID NO: 81 

MSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAFNATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIG 
YVEIPAIDQEIPMYVGTSEDILQKGAGLLEGASLPVGGENTHTVITAHRGLPTAELFSQLDKMKKGDIFYLHVLD 
QVLAYQVDQIVTVEPNDFEPVLIQHGEDYATX.LTCTPYMINSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWL 
LLGAMAVILLLLYRVYRNRRIVKGLEKQLEGRHVKD 

SrtB (SP0466) is a sortase. An example of an amino acid sequence of SrtB is set forth in 
SEQ ID NO: 82. 
SEQ ID NO: 82 

MAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDSLNNVVSGDPWSEEMKKKGRAEYARM 
LEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTHAVITAHTGLPTAKMFTDLTKLKVGD 
KFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINTHRLLVRGHRIPYVAEVEEEFIAANK 
L S HL YRYL F Y VAVGL I VI LLWI I RRL RKKKKQPEKALKALKAARKE VKVE DGQQ 

Sp0465 is a hypothetical protein. An example of an amino acid sequence of Sp0465 is set 
forth in SEQ ID NO: 83. 
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MFLPFLSASLYLQTHHFIAFPNRQSYLLRETRKSHFFLIHHPF 

RrgC (SP0464) is a cell wall surface anchor family protein. RrgC contains a sortase substrate 
5 motif VPXTG (SEQ ID NO: 137), shown in italics in SEQ ID NO: 84. 
SEQ ID NO: 84 

MISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRVQIVRDLHS 
WDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAKKTDTMTTK 
VKLIKVDQDHNRLEGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGNYRFKEVEP 
10 LAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVLQNGKEVVV 
TSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDrGEETLYILML 
VAILLFGSGYYLTKKPNN 

RrgB (Sp0463) is a cell wall surface anchor protein. RrgB contains a sortase substrate motif 
15 IPXTG (SEQ ID NO: 133), shown in italics in SEQ ID NO: 85. 
SEQ ID NO: 85 

MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKE 
IAGVMFVWTNTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKIYEIHSLSTY 
VGEDGATLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIV 

20 TKIPALANYATANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKI 
TYSATLNDKAIVEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTG 
KVVQTVTLTTDKNTVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKK 
FVKVNDKDNRLAGAEFVIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDK 
AQAAYNAAVIAANNAFEWVADKDNENWKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSY 

25 SATGQGIEYTAGSGKDDATKVVNKKITIPQ!TGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLA 

RrgA (Sp0462) is a cell wall surface anchor protein. RrgA contains a sortase substrate motif 
YPXTG (SEQ ID NO: 186), indicated in italics in SEQ ID NO: 86. 
SEQ ID NO: 86 

30 MLNRETHMKKVRKI FQKAVAGLCCI SQLTAFS S I VALAET PETS PAI GKVVI KETGEGGALLGDAVFELKNNT DG 
TTVSQRTEAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGT 
YPDVQTPYQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPL 
DVVILLDNSNSMSNIRNKNARRAERAGEATRSLIDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLN 
DSLFWNYDQTSFTTNTKDYSYLKLTNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQAR 

35 QNSQKVIFHITDGVPTMSYPINFNHATFAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQM 
FTDKTVYEKGAPAAFPVKPEKYSEMKAAGYAVIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNG 
NIAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITD 
PMGELIDLQLGTDGRFDPADYTLTANDGSRLENGQAVGGPQNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVT 
LTYNVRLNDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKIRDVRKYPEITISKEKKLGDIEFIKVNKNDKKP 

40 LRGAVFSLQKQHPDYPDI YGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVN 
GEVRDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKRE i"Pi? TGG I GMLPFYL I GCMMMGGVLLYTRKHP 

RlrA (Sp0461) is a transcriptional regulator. An example of an amino acid sequence for RlrA 
is set forth in SEQ ID NO: 87. 
45 SEQ ID NO: 87 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
50 KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

As discussed above, a S. pneumoniae AI sequence is present in the S. pneumoniae strain 670 
55 genome. Examples of S. pneumoniae AI sequences are set forth below. 
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an amino acid sequence of orf 1_670 is set forth in 



SEQIDNO: 171. 
SEQ ID NO: 171 

MEHINHTTLLIGIKDKNITLNKAIQHDTHIEVFATLDYHPPKCKHCKGKQIKYDFQKPSKIPFIEIGGFPSLIHL 
KKRRFQCKSCRKVTVAETTLVQKNCQISEMVRQKIAQLLLNREALTHIASKLAISTSTSTVYRKLKQFHFQEDYT 
TLPEILSWDEFSYQKGKLAFIAQDFNTKKIMTILDNRRQTTIRNHFFKYSKEARKKVKVVTVDMSGSYI PLIKKL 
FPNAKIVLDRFHIVQHMSRALNQTRINIMKQFDDKSLEYRALKYYWKFILKDSRKLSLKPFYARTFRETLTPREC 
LKKIFTLVPELKDYYDLYQLLLFHLQEKNTDQFWGLIQDTLPHLNRTFKTTLSTFICYKNYITNAIELPYSNAKL 
E ATNKL I KDI KRN AFG FRN FEN FKKRI FI ALN I KKE RT KF VL S RA 

Orf2_670 is a transcriptional regulator. An example of an amino acid sequence of Orf2_670 
is set forth in SEQ ID NO: 172. 
SEQ ID NO: 172 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ' 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
E YLPYVKKQYPKGKHHFLT I ALDLH VSQQRL I YQT I VDI RKE AFDKRVAMI AKKAH YLL 

Orf3_670 is a cell wall surface anchor family proten. An example of an amino acid sequence 
of Orf3 J570 is set forth in SEQ ID NO: 1 73 . 
SEQ ID NO: 173 

MLNRETHMKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDG 
TTVSQRTEAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGT 
YPDVQTPYQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTTVETKEASTPL 
DVVILLDNSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILN 
DSALWTFDRTTFTAKTYNYSFLNLTSDPTD1QTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQAR 
PNSKKVIFHITDGVPTMSYPINFKYTGTTQSYRTQLNNFECAKTPNSSGILLEDFVTWSADGEHKIVRGDGESYQM 
FTKKPVTDQYGVHQILSITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMA 
QDGYDVFTVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMG 
ELIDFQLGADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTY 
NVRLNDQFVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRD 
AVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEV 
RDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

Orf4_670 is a cell wall surface anchor family protein. An example of an amino acid sequence 
of orf4_670 is set forth in SEQ ID NO: 174. 
SEQ ID NO: 174 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGNNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINPEEPRVKTYGKKFVKVDQKDTRLE 
NAQFVVKKADSNKYIAFKSTAQQAADEKAAATAKQKLDAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIKAPEGFAKIDDVEFVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKITIPQTGGIGTII FAVAG AAIMG I AV YA Y VKNNKDE DQL A 

Orf5_670 is a cell wall surface anchor family protein. An example of an amino acid sequence 
of orf5_670 is set forth in SEQ ID NO: 175. 
SEQ ID NO: 175 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAK 
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YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 
QNGKEWVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 

5 

Orf6_670 is a sortase. An example of an amino acid sequence of orf6_670 is set forth in SEQ 
ID NO: 176. 
SEQ ID NO: 176 

MLIKMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
10 LNN VVS G D P W S EEMKKKGRAE YARMLE I HE RMGH VE I PVI D VDL P VY AGT AEE VLQQGAGHLE GT S L P I GGN S T H 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALICALKAARKEVKVE 
DGQQ 

15 Orf7_670 is a sortase. An example of an amino acid sequence of orf7_670 is set forth in SEQ 

ID NO: 177. 
SEQ ID NO: 177 

VSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAFNATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIG 
YVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGENTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLD 
20 QVLAYQVDQILTVE PNDFEPVLIQHGEDYATLLTCTPYMINSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWL 
LLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

Or£8_670 is a sortase. An example of an amino acid sequence of orf8_670 is set forth in SEQ 
ID NO: 178. 
25 SEQ ID NO: 178 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

30 

As discussed above, a S, pneumoniae AI sequence is present in the 19A Hungary 6 S. 
pneumoniae genome. Examples of S. pneumoniae AI sequences from 19A Hungary 6 are set forth 
below. 

ORF2_19AH is a transcriptional regulator. An example of an amino acid sequence of 
35 ORF2_19AH is set forth in SEQ ID NO: 187. 
SEQ ID NO: 187 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TN YYFHQL YNQSX I LKI LRFFLLQGNQS FNEFTQKE YI S I ATGYRVRQKCGLLLRS VGL DL VKNQVVGPE YRI RF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
40 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
ECNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

ORF3_19AH is a cell wall surface protein. An example of an amino acid sequence of 
45 ORF31 9 AH is set forth in SEQ ID NO: 1 88 . 
SEQ ID NO: 188 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRT 
EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 
50 NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKIVRGDGESYQMFTKKPVT 
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TVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMGELIDFQL 
GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
5 QHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRL FENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIV 
PQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKNP 

ORP4_19AH is a cell wall surface protein. An example of an amino acid sequence of 
ORF4J9AH is set forth in SEQ ID NO: 189. 
10 SEQ ID NO: 189 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGXNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
15 SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINPEEPRVKTYGKKFVKVDQKDTRLE 
NAQFVVKKADSNKYIAFKSTAQQAADEKAAATAKQKL DAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VE VAGKDE AMVLTSNT DGQFQI S GLAAGT YKLEE I KAPEGFAKI D,DVE FVVGAGS WNQGEFN YLKDVQKNDATKV 
VNKKI T I P QT GG I GT 1 1 FAVAGAAIMGI A V Y A Y VKNNKDE DQL A 

20 

ORF5_19AH is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_19AH is set forth in SEQ ID NO: 190. 
SEQ ID NO: 190 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
25 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHI PNGLYYVRSIIQTDAVSYPAE FLFEMTDQTVEPLVIVAK 
KTDTMTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGN 
YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 
QNGKE VVVTSGKDGRFRVEGLE YGT YYLWELQAPTG YVQLT S P VS FT IGKDTRKEL VT VVECNNKRPRI DVP DTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 

30 

ORF6 19AH is a putative sortase. An example of an amino acid sequence of ORF619AH is 
set forth in SEQ ID NO: 191. 
SEQ ID NO: 191 

MLIKMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
35 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALKALKAARKEVKVE 
DGQQ 

40 ORF7_l 9 AH is a putative sortase. An example of an amino acid sequence of ORF7_19AH is 

set forth in SEQ ID NO: 192. 
SEQ ID NO: 192 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVS RYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
45 NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORF8_19AH is a putative sortase. An example of an amino acid sequence of ORF8_19AH is 
set forth in SEQ ID NO: 193. 
50 SEQ ID NO: 193 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFMGILFVLWKLARLLRGK 

55 
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II'" 1 ' Ks^yS^ AI sequence is present in the 6B Finland 12 S. 

pneumoniae genome. Examples of & pneumoniae AI sequences from 6B Finland 12 are set forth 
below. 

ORF26BF is a transcriptional regulator. An example of an amino acid sequence of 
5 ORF2_6BF is set forth in SEQ ID NO: 194. 
SEQ ID NO: 194 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEETQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
10 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIAIiDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

15 ORF3_6BF is a cell wall surface protein. An example of an amino acid sequence of 

ORF3 6BF is set forth in SEQ ID NO: 195. 
SEQ ID NO: 195 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRT 
EAQTGEAI FSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 

20 YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRI YQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 
NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
. FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKIVRGDGESYQMFTKKPVT 
DQYGVHQILSITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMAQDGYDVF 

25 TVGVGVNGDPGTDBATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMGELIDFQL 
GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
QHP DYPDI YG AI DQNGT YQN VRTGE DGKLT FKNL S DGKYRL FEN S E P AGYKPVQNKP I VAFQI VNGE VRDVT S I V 
PQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

30 

ORF4JSBF is a cell wall surface protein. An example of an amino acid sequence of 
ORF4__6BF is set forth in SEQ ID NO: 196. 
SEQ ID NO: 196 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
35 DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
, NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPWDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGNNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINPEEPRVKTYGKKFVKVDQKDTRLE 
40 NAQFVVKECADSNKYIAFKSTAQQAADEKAAATAKQKIiDAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIKAPEGFAKIDDVEFVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKITIPQTGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLA 

ORF5_6BF is a cell wall surface protein. An example of an amino acid sequence of 
45 ORF5_6BF is set forth in SEQ ID NO: 1 97. 
SEQ ID NO: 197 

. MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
Q I VRDLHS W DENKL S S FKKT S FEMT FLENQ I E VS H I PNGL YY VRS 1 1 QT DAVS Y P AE FL FEMT DQTVE PL VI VAK 
KTDTMTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGN 
50 YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 
QNGKEWVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTWKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 
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§tf -Is"! jSutfifevfe iSraie?'' An example of an amino acid sequence of ORF6_6BF is set 
forth in SEQ ID NO: 198. 
SEQ ID NO: 198 

MLIKMVKTKKQKRNNLLLGWFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
5 • LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEI PVI DVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGRDYVTLLTCTPYMINT 
HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALKALKAARKEVKVE 
DGQQ 

10 ORF7_6BF is a putative sortase. An example of an amino acid sequence of ORF7_6BF is set 

forth in SEQ ID NO: 199. 
SEQ ID NO: 199 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAI YPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
15 NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORF86BF is a putative sortase. An example of an amino acid sequence of ORF8_6BF is set 
forth in SEQ ID NO: 200. 
20 SEQ ID NO: 200 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

25 

As discussed above, a S. pneumoniae AI sequence is present in the 6B Spain 2 S. pneumoniae 
genome. Examples of S. pneumoniae AI sequences from 6B Spain 2 are set forth below. 

ORF2_6BSP is a transcriptional regulator. An example of an amino acid sequence of 
ORF2_6BSP is set forth in SEQ ID NO: 20 1. 
30 SEQ ID NO: 201 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
IiKNLFMYPILMEHCQTYLEPHANMTFTQEELDYlFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
35 KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLI YQTIVDIRKEAFDKRVAMIAKKAHYLL 

ORF3J5BSP is a cell wall surface protein. An example of an amino acid sequence of 
40 ORF3J5BSP is set forth in SEQ ID NO: 202. 
SEQ ID NO: 202 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRT 
EAQTGEAIFSNIKPGT.YTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 

45 NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKI VRGDGESYQMFTKKPVT 
DQYGVHQILSITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMAQDGYDVF 
TVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMGELIDFQL 

50 GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
QHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIV 
PQD I PAGYE FTNDKHYI TNEPI PPKRE YPRTGGI GML PFYLI GCMMMGGVLL YTRKHP 



-206- 



WO 2006/078318 




"'"II / IT / T" "S 

Is a cell wall "surface protein. An example of an amino acid sequence of 




PCT/US2005/027239 



ORF4J5BSP is set forth in SEQ ID NO: 203. 
SEQ ID NO: 203 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIRE DRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGNNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINPEEPRVKTYGKKFVKVDQKDTRLE 
NAQFVVKKADSNKYIAFKSTAQQAADEKAAATAKQKLDAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIKAPEGFAKIDDVE FVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKITIPQTGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLA 

ORF5_6BSP is a cell wall surface protein. An example of an amino acid sequence of 
ORP5_6BSP is set forth in SEQ ID NO: 204. 
SEQ ID NO: 204 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAE FLFEMTDQTVEPLVIVAK 
KTDTMTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGN 
YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 
QNGKEVVVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 

ORF6 6BSP is a putative sortase. An example of an amino acid sequence of ORF6_6BSP is 
set forth in SEQ ID NO: 205. 
SEQ ID NO: 205 

MLIPCMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALKALKAARKEVKVE 
DGQQ 

ORF7 6BSP is a putative sortase. An example of an amino acid sequence of ORP7_6BSP is 
set forth in SEQ ID NO: 206. 
SEQ ID NO: 206 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORF8 6BSP is a putative sortase. An example of an amino acid sequence of ORP8_6BSP is 
set forth in SEQ ID NO: 207. 
SEQ ID NO: 207 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKS D PQT AAVARVAFT KE G Q S VS RVAT S QWL YRGL VVX AFL GIL F VLWKL ARLLRGK 

As discussed above, a S, pneumoniae AI sequence is present in the 9V Spain 3 S. pneumoniae 
genome. Examples of S. pneumoniae AI sequences from 9V Spain 3 are set forth below. 

ORF2_9VSP is a transcriptional regulator. An example of an amino acid sequence of 
ORP2_9VSP is set forth in SEQ ID NO: 208. 
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MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEI YDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
5 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPATPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

10 ORF3^9VSP is a cell wall surface protein. An example of an amino acid sequence of 

ORF3_9VSP is set forth in SEQ ID NO: 209. 
SEQ ID NO: 209 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTNGTTVSQRT 
EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQRTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 

15 YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYERKDKSVPLDVVILLD 
NSNSMSNIRNKNARRAERAGEATRSLIDKITSDPENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNY 
DQTSFTTNTKDYSYLKLTNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVI 
FHITDGVPTMSYPINFNHAT FAPSYQNQLNAFFSKS PNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVY 
EKGAPAAFPVKPEKYSEMKAVGYAVIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNIAPDGY 

20 DVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELID 
LQLGTDGRFDPADYTLTANDGSRLENGQAVGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRL 
NDQFVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPAITIAKEKKLGEIEFIKINKNDKKPLRDAVFS 
LQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVT 
SIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLLFYLIGCMMMGGVLLYTRKHP 

25 

ORF4_9VSP is a cell wall surface protein. An example of an amino acid sequence of 
ORP4_9VSP is set forth in SEQ ID NO: 210. 
SEQ ID NO: 210 

MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLAT DGDMDKIANELETGNYAGNKVGVLPANAKE 
30 IAGVMFVWTNTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKIYEIHSLSTY 
VGEDGATLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIV 
TKIPALANYATANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQWAEKTVKI 
TYSATLNDKAIVEVPESNDVT FNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTG 
KVVQTVTLTTDKNTVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKK 
35 FVKVNDKDNRLAGAEFVIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDK 
AQAAYNAAVIAANNAFEWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSY 
SATGQGIEYTAGSGKDDATKVVNKKITIPQTGGIGTIIFAVAGAVIMGIAVYAYVKNNKDEDQLA 

ORF5 9VSP is a cell wall surface protein. An example of an amino acid sequence of 
40 ORF5 J>VSP is set forth in SEQ ID NO: 211. 
SEQ ID NO: 211 

MTMQKMQKMQKMQKMQKMQKMIS 

KLDDSYSYDNRVQIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMT 
DQTVEPLVIVAKKADTVTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKN 
45 GEI VVTNL PLGTYRFKE VE PLAGYT VTTMDT DVQLVDHQLVT I T VVNQKLPRGN VDFMKVDGRTNTSLQGAMFKV 
MKEENGHYTPVLQNGKEVVVASGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNN 
KRPRIDVPDTGEETLYILMLVAILLFGSGYYLTKKTNN 

ORF6__9VSP is a putative sortase. An example of an amino acid sequence of ORF6_9VSP is 
50 set forth in SEQ ID NO: 212. 
SEQ ID NO: 212 

MLIKMAKTKKQKRNNLLLGVVFFIGIAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPAIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
55 HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKRQSERALKALKEATKEVKVE 
DE 
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ORF7_9VSP is a putative sortase. An example of an amino acid sequence of ORF7_9VSP is 
set forth in SEQ ID NO: 213. 
SEQ ID NO: 213 

5 MSKSRYSRKKSVKKKKNPFILLLI FLVGLAVAMYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
NTHTWTAHRGLPTAELFSQLDKMKKGDIFYLHVLDQVLAYQVDQIVTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLGAMAVILLLLYRVYRNRRIVKGLEKQLEGRHVKD 

10 ORF89VSP is a putative sortase. An example of an amino acid sequence of ORF8_9VSP is 

set forth in SEQ ID NO: 214. 
SEQ ID NO: 214 

MSRTKLRALLGYLLMLVACLIPIYCFGQMVLQSLGQVKGHAT FVKSMTTEMYQEQQNHSLAYNQRLASQNRI VDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSH 
15 VFFRHL DQLKVG DAL Y Y DNGQE I VE YQMM DTEI I LP SE WEKLE S VS SKNTMTL I T C DP I PT FNKRLL VN FERVAV 
YQKS D PQT AA VARVAFT KE GQS VS RVAT S QWL YRGL VVL AFL GIL FVL WKL ARLL RGK 

As discussed above, a & pneumoniae AI sequence is present in the 14 CSR 10 S. pneumoniae 
genome. Examples of S. pneumoniae AI sequences from 14 CSR 10 are set forth below, 
20 ORF2_14CSR is a transcriptional regulator. An example of an amino acid sequence of 

ORF2__14CSR is set forth in SEQ ID NO: 215. 
SEQ ID NO: 215 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
25 LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALXFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

30 

ORF3_14CSR is a cell wall surface protein. An example of an amino acid sequence of 
' ORF3__14CSR is set forth in SEQ ID NO: 2 1 6. 
SEQ ID NO: 216 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRT 
35 EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 
NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKT PNSSGILLEDFVTWSADGEHKIVRGDGESYQMFTKKPVT 
40 DQYGVHQILSITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMAQDGYDVF 
TVGVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNEKKSIENGTITDPMGELIDFQL 
GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
QHP D YP D I YGAI DQNGT YQNVRTGE DGKLT FKNLS DGKYRLFENS EPAGYKP VQNKP I VAFQ I VNGEVRDVT S I V 
45 PQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

ORF4_14CSR is a cell wall surface protein. An example of an amino acid sequence of 
ORF4_14CSR is set forth in SEQ ID NO: 217. 
SEQ ID NO: 217 

50 MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGNNGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
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NAQFWKKADSNKYIAFKSTAQQAADEKAAATAKQKLDAAVAAYTNAADKQAAQALVDQAQQEYNVAYKEAKFGY 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIPCAPEGFAKIDDVEFVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKITI PQTGGIGTI I FAVAGAAIMGI AVYAYVKNNKDE DQLA 

ORF514CSR is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_14CSR is set forth in SEQ ID NO: 218. 
SEQ ID NO: 218 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
10 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAK 
KTDTMTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGN 
YRFKEVEPLAGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVL 
QNGKEVVVTSGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTXGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKPNN 

15 

ORF614CSR is a putative sortase. An example of an amino acid sequence of ORF6_14CSR 
is set forth in SEQ ID NO: 219. 
SEQ ID NO: 219 

MLIKMVT^TKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADI DERMKLAQAFNDS 

20 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 

AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 

HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALKALKAARKEVKVE 
DGQQ 

25 ORF7_14CSR is a putative sortase. An example of an amino acid sequence of ORF7_14CSR 

is set forth in SEQ ID NO: 220. 
SEQ ID NO: 220 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
30 NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORF8__14CSR is a putative sortase. An example of an amino acid sequence of ORF8__14CSR 
is set forth in SEQ ID NO: 221. 
35 SEQ ID NO: 221 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
YQKS DPQTAAVARVAFTKEGQS VS RVAT SQWL YRGL WLAFLGI LFVLWKLARLLRGK 

40 

As discussed above, a S. pneumoniae AI sequence is present in the 19F Taiwan 14 S. 
pneumoniae genome. Examples of S. pneumoniae AI sequences from 19F Taiwan 14 are set forth 
below. 

ORF2_19FTW is a transcriptional regulator. An example of an amino acid sequence of 
45 ORJF21 9FTW is set forth in SEQ ID NO: 222. 
SEQ ID NO: 222 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEIYDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
50 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 
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ORF3_19FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF3_19FTW is set forth in SEQ ID NO: 223. 
SEQ ID NO: 223 

5 MKKVRKI FQKAVAGLCCI S QLT AFS S I VALAET PET S P AI GKVV I KET GE GGALLG D AVFELKNNT DGTT VS QRT 
EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRI YQVNNLDDNQYGIELTVSGKTVYERKDKSVPLDVVILLD 
NSNSMSNIRNKNARRAERAGEATRSLIDKITSDPENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNY 
DQTSFTTNTKDYSYLKLTNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVI 

10 FHITDGVPTMSYPINFNHATFAPSYQNQLNAFFSKSPNKDGIIiLSDFITQATSGEHTIVRGDGQSYQMFTDKTVY 
EKGAPAAFPVKPEKYSEMKAVGYAVIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGAPTRWYYNGNIAPDGY 
DVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELID 
LQLGTDGRFDPADYTLTANDGSRLENGQAVGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRL 
NDQFVSNKFYDTNGRTTLHPKEVEKNTVRDFPI PKIRDVRKYPAITIAKEKKLGEIEFIKINKNDKKPLRDAVFS 

15 LQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVT 
SIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

ORF4_19FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF4J9FTW is set forth in SEQ ID NO: 224. 
20 SEQ ID NO: 224 

MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVL PANAKE 
IAGVMFVWTNTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKIYEIHSLSTY 
VGEDGATLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIV 
TKIPALANYATANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKI 
25 TYSATLNDKAIVEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTG 
KVVQTVTIiTTDKNTVTVNGLDKNTEYKFVERSIKGYSADYQE I.TTAGEIAVKNWKDENPKPLDPTEPPCWTYGKK 
FVKVNDKDNRLAGAEFVIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDK 
AQAAYNAAVIAANNAFEWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSY 
SATGQGIEYTAGSGKDDATKVVNKKITIPQTGGIGTIIFAVAGAVIMGIAVYAYVKNNKDEDQLA 

30 

ORF5_19FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_19FTW is set forth in SEQ ID NO: 225. 
SEQ ID NO: 225 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDNRV 
35 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAK 
KADTVTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIVVTNLPLGT 
YRFKEVEPLAGYTVTTMDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEENGHYTPVL 
QNGKEVVVASGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKTNN 

40 

ORF6_19FTW is a putative sortase. An example of an amino acid sequence of 
ORF6_19FTW is set forth in SEQ ID NO: 226. 
SEQ ID NO: 226 

MLIKMAKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATIiDEADIDERMKLAQAFNDS 
45 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPAIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKRQSERALKALKEATKEVKVE 
DE 

50 ORF7_19FTW is a putative sortase. An example of an amino acid sequence of 

ORF7_19FTW is set forth in SEQ ID NO: 227. 
SEQ ID NO: 227 

MSKSRYSRKKSVKKKKNPFILLLIFLVGLAVAMYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTDQEKKQGVSEYANMLKVHERIGYVEIPAIEQEIPMYVGTSEDILQKGAGLLEGASLPVGGE 
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INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLGAMAVILLLLYRVYRNRRIVKGLEKQLEGRHVKD 

ORF8_19FTW is a putative sortase. An example of an amino acid sequence of 
5 ORF8_19FTW is set forth in SEQ ID NO: 228. 
SEQ ID NO: 228 

MSRTKLRALLGYLLMLVACLIPIYCFGQMVLQSLGQVKGHATFVKSMTTEMYQEQQNHSLAYNQRLASQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
10 YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

As discussed above, a S. pneumoniae AI sequence is present in the 23F Taiwan 15 S. 
pneumoniae genome. Examples of S. pneumoniae AI sequences from 23F Taiwan 15 are set forth 
below. 

15 ORF2_23FTW is a transcriptional regulator. An example of an amino acid sequence of 

ORF2_23FTW is set forth in SEQ ID NO: 229. 
SEQ ID NO: 229 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
20 LIALLQFHFGIEI YDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

25 

ORF3_23FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF3_23FTW is set forth in SEQ ID NO: 230. 
SEQ ID NO: 230 

MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGXTVSQRT 
30 EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQIIKVDGSEKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPLDVVILLD 
NSNSMSNIRNKNARRAERAGEATRSLIDKITSDPENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNY 
DQTSFTTNTKDYSYLKLTNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVI 
FHITDGVPTMSYP1NFNHATFAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVY 
35 EKGAPAAFPVKP&KYSEMKAAGYAVIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNIAPDGY 
DVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELID 
LQLGTDGRFDPADYTLTANDGSRLENGQAVGGPQNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVTLTYNVRL 
NDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKIRDVRKYPEITISKEKKLGDIEFIKVNKNDKKPLRDAVFS 
LQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVT 
40 SIVPQDIPAGYEFTNDKHYITNEPI PPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKHP 

ORF4_23FTW is a cell wall surface protein. An example of an amino acid sequence of 
ORF4_23FTW is set forth in SEQ ID NO: 231. 
SEQ ID NO: 231 

45 MKSINKFLTILAALLLTVSSLFSAATVFAAEQKTKTLTVHKLLMTDQELDAWNSDAITTAGYDGSQNFEQFKQLQ 
GVPQGVTEISGVAFELQSYTGPQGKEQENLTNDAVWTAVNKGVTTETGVKFDTEVLQGTYRLVEVRKESTYVGPN 
GKVLTGMKAVPALITLPLVNQNGVVENAHVYPKNSEDKPTATKTFDTAAGFVDPGEKGLAIGTKVPYIVTTTIPK 
NSTLATAFWSDEMTEGLDYNGDVVVNYNGQPLDNSHYTLEAGHNGFILKLNEKGLEAINGKDAEATITLKYTATL 
NALAVADVPEANDVTFHYGNNPGHGNTPKPNKPKNGELTITKTWADAKDAPIAGVEVTFDLVNAQTGEVVKVPGH 

50 ETGIVLNQTNNWTFTATGLDNNTEYKFVERTIKGYSADYQTITETGKIAVKNWKDENPEPINPEEPRVKTYGKKF 
VKVDQKDERLKEAQFVVKNEQGKYLALKSAAQQAVNEKAAAEAKQALDT^AIAAYTNAADKNAAQAVVDAAQKTYN 
DNYRAARFGYVEVERKEDALVLTSNTDGQFQISGLAAGSYTLEETKAPEGFAKLGDVKFEVGAGSWNQGDFNYLK 
DVQKNDATKVVNKKITI PQTGGIGT 1 1 FAVAGAVIMGIAVYAYVKNNKDEDQLA 
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P C "'B'RfP SSP^iis-a iell J ^fi « : IrMce protein. An example of an amino acid sequence of 
ORF5_23FTW is set forth in SEQ ID NO: 232. 
SEQ ID NO: 232 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDNRV 
5 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAE FLFEMTDQTVEPLVIVAK 
KADTVTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIVVTNLPLGT 
YRFKEVEPLAGYTVTTMDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEENGHYTPVL 
QNGKEVVVASGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKTNN 

10 

ORF6_23FTW is a putative sortase. An example of an amino acid sequence of 
ORF6_23FTW is set forth in SEQ ID NO: 233 . 
SEQ ID NO: 233 

MLIECMVKTKKQKRNNLLLGVVFFIGMAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
15 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPVIDVDLPVYAGTAEEVLQQGAGQLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLLVRGHRIPYVAEVEEEFIAANKLSHLYRYLFYVAVGLIVILLWIIRRLRKKKKQPEKALKALKAARKEVKVE 
DGQQ 

20 ORF7J23FTW is a putative sortase. An example of an amino acid sequence of 

ORF7_23FTW is set forth in SEQ ID NO: 234. 
SEQ ID NO: 234 

MDNSRRSRKKGTKKKKHPLILLLIFLVGFAVAIYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
25 NTHTVVTAHRGLPTAELFSQLDKMKKGDVFYLHVLDQVLAYQVDQILTVEPNDFEPVLIQHGKDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLAALVMILVLSYGVYRHRRIVKGLEKQLEEHHVKG 

ORF8_23FTW is a putative sortase. An example of an amino acid sequence of 
ORF8_23FTW is set forth in SEQ ID NO: 235. 
30 SEQ ID NO: 235 

MSKAKLQKLLGYLLMLVALVIPVYCFGQMVLQSLGQVKGHEIFSESVTADSYQEQLQRSLDYNQRLDSQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLAMGLAHVDGTPLPVEGKGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPI^TFNKRLLVNFERVAV 
YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

35 

As discussed above, a S. pneumoniae AI sequence is present in the 23F Poland 16 S. 
pneumoniae genome. Examples of S. pneumoniae AI sequences from 23F Poland 16 are set forth 
below. 

ORF2_23FP is a transcriptional regulator. An example of an amino acid sequence of 
40 ORF2_23FP is set forth in SEQ ID NO: 236. 
SEQ ID NO: 236 

MLNKYIEKRITDKITILNILLDIRSIELDELSTLTSLQSKSLLSILQELQETFEEELTFNLDTQQVQLIEHHSHQ 
TNYYFHQLYNQSTILKILRFFLLQGNQSFNEFTQKEYISIATGYRVRQKCGLLLRSVGLDLVKNQVVGPEYRIRF 
LIALLQFHFGIEI YDLNDGSMDWVTHMIVQSNSQLSHELLEITPDEYVHFSILVALTWKRREFPLEFPESKEFEK 
45 LKNLFMYPILMEHCQTYLEPHANMTFTQEELDYIFLVYCSANSSFSKDKWNQEKKTHTIQLILQHTRGKHLLSKF 
KNILGNDISNSLSFLTALTFLTRTFLFGLQNLVPYYNYYEHYGIESDKPLYHISKAIVQEWMTEQKIEGVIDQHR 
LYLFSLYLTETIFSSLPAIPIFIILNNQADVNLIKSIILRNFTDKVASVTGYNILISPPPSEEHLTEPLIIITTK 
EYLPYVKKQYPKGKHHFLTIALDLHVSQQRLIYQTIVDIRKEAFDKRVAMIAKKAHYLL 

50 ORF3_23FP is a cell wall surface protein. An example of an amino acid sequence of 

ORF3_23FP is set forth in SEQ ID NO: 237. 
SEQ ID NO: 237 
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EAQTGEAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTP 
YQI IKVDGSEKNGQHKALNPNPYERVI PEGTLSKRI YQVNNLDDNQYGIELTVSGKTTVETKEASTPLDVVILLD 
NSNSMSNIRHNHAHRAEKAGEATRALVDKITSNPDNRVALVTYGSTIFDGSEATVEKGVADANGKILNDSALWTF 
5 DRTTFTAKTYNYSFLNLTSDPTDIQTIKDRIPSDAEELNKDKLMYQFGATFTQKALMTADDILTKQARPNSKKVI 
FHITDGVPTMSYPINFKYTGTTQSYRTQLNNFKAKTPNSSGILLEDFVTWSADGEHKIVRGDGESYQMFTKKPVT 
DQYGVHQILS ITSMEQRAKLVSAGYRFYGTDLYLYWRDSILAYPFNSSTDWITNHGDPTTWYYNGNMAQDGYDVF 
TVGVGVNGDPGT DEATATRFMQS I S S S PDN YTNVADPSQILQELNRYFYTI VNEKKSIENGT IT DPMGELI DFQL 
GADGRFDPADYTLTANDGSSLVNNVPTGGPQNDGGLLKNAKVFYDTTEKRIRVTGLYLGTGEKVTLTYNVRLNDQ 
10 FVSNKFYDTNGRTTLHPKEVEKNTVRDFPIPKIRDVRKYPEITIPKEKKLGEIEFIKINKNDKKPLRDAVFSLQK 
QHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDGKYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIV 
PQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIGCMMMGGVLLYTRKNP 

ORF4_23FP is a cell wall surface protein. An example of an amino acid sequence of 
ORF4J23FP is set forth in SEQ ID NO: 238. 
15 SEQ ID NO: 238 

MKSINKFLTMLAALLLTASSLFSAATVFAADNVSTAPDAVTKTLTIHKLLLSEDDLKTWDTNGPKGYDGTQSSLK 
DLTGVVAEEIPNVYFELQKYNLTDGKEKENLKDDSKWTTVHGGLTTKDGLKIETSTLKGVYRIREDRTKTTYVGP 
NGQVLTGSKAVPALVTLPLVNNNGTVIDAHVFPKNSYNKPVVDKRIADTLNYNDQNGLSIGTKIPYVVNTTIPSN 
ATFATSFWSDEMTEGLTYNEDVTITLNNVAMDQADYEVTKGINGFNLKLTEAGLAKINGKDADQKIQITYSATLN 
20 SLAVADIPESNDITYHYGNHQDHGNTPKPTKPNNGQITVTKTWDSQPAPEGVKATVQLVNAKTGEKVGAPVELSE 
NNWTYTWSGLDNSIEYKVEEEYNGYSAEYTVESKGKLGVKNWKDNNPAPINLEEPRVKTYGKKFVKVDQKDTRLE 
NAQ F V VKKAD SNKY I AFKS T AQQAADE KAAAT AKQKL DAAVAA YTN AA DKQAAQAL VDQ AQQE YN VA YKE AKFG Y 
VEVAGKDEAMVLTSNTDGQFQISGLAAGTYKLEEIKAPEGFAKIDDVEFVVGAGSWNQGEFNYLKDVQKNDATKV 
VNKKI TI PQT GGI GT 1 1 FAVAGAVIMG I AVYAYVKNNKDEDQLA 

25 

ORF5__23FP is a cell wall surface protein. An example of an amino acid sequence of 
ORF5_23FP is set forth in SEQ ID NO: 239. 
SEQ ID NO: 239 

MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDNRV 
30 QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVE PLVIVAK 
KADTVTTKVKLIKVDQDHNRLEGVGFKLVSVARDGSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIVVTNLPLGT 
YRFKEVEPLAGYAVTTMDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEENGHYTPVL 
QNGKEVVVASGKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGE 
ETLYILMLVAILLFGSGYYLTKKTNN 

35 

ORF6_23FP is a putative sortase. An example of an amino acid sequence of ORF6_23FP is 
set forth in SEQ ID NO: 240. 
SEQ ID NO: 240 

MLIKMAKTKKQKRNNLLLGVVFFIGIAVMAYPLVSRLYYRVESNQQIADFDKEKATLDEADIDERMKLAQAFNDS 
40 LNNVVSGDPWSEEMKKKGRAEYARMLEIHERMGHVEIPAIDVDLPVYAGTAEEVLQQGAGHLEGTSLPIGGNSTH 
AVITAHTGLPTAKMFTDLTKLKVGDKFYVHNIKEVMAYQVDQVKVIEPTNFDDLLIVPGHDYVTLLTCTPYMINT 
HRLL VRGHRI P Y VAE VEEE F I AANKL S HL YRYL F YVAVGL I VI LL WI I RRLRKKKRQS ERALKALKE ATKE VKVE 
DE 

45 ORF7_23FP is a putative sortase. An example of an amino acid sequence of ORF7_23FP is 

set forth in SEQ ID NO: 241. 
SEQ ID NO: 241 

MSKSRYSRKKSVKKKKNPFILLLIFLVGLAVAMYPLVSRYYYRIESNEVIKEFDETVSQMDKAELEERWRLAQAF 
NATLKPSEILDPFTEQEKKKGVSEYANMLKVHERIGYVEIPAIDQEIPMYVGTSEEILQKGAGLLEGASLPVGGE 
50 NTHTVVTAHRGLPTAELFSQLDPCMKKGDIFYLHVL DQVLAYQVDQIVTVEPNDFEPVLIQHGEDYATLLTCTPYM 
INSHRLLVRGKRIPYTAPIAERNRAVRERGQFWLWLLLGAMAVILLLLYRVYRNRRIVKGLEKQLEGRHVKD 

ORF8_23FP is a putative sortase. An example of an amino acid sequence of ORF8_23FP is 
set forth in SEQ ID NO: 242. 
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MSRTKLRALLGYLLMLVACLI PIYCFGQMVLQSLGQVKGHAT FVKSMTTEMYQEQQNHSLAYNQRLASQNRIVDP 
FLAEGYEVNYQVSDDPDAVYGYLSIPSLEIMEPVYLGADYHHLGMGLAHVDGTPLPLDGTGIRSVIAGHRAEPSH 
VFFRHLDQLKVGDALYYDNGQEIVEYQMMDTEIILPSEWEKLESVSSKNIMTLITCDPIPTFNKRLLVNFERVAV 
5 YQKSDPQTAAVARVAFTKEGQSVSRVATSQWLYRGLVVLAFLGILFVLWKLARLLRGK 

Immunogenic compositions of the invention comprising AI antigens may further comprise 
one or more antigenic agents. Preferred antigens include those listed below. Additionally, the 
compositions of the present invention may be used to treat or prevent infections caused by any of the 
below-listed microbes. Antigens for use in the immunogenic compositions include, but are not 
limited to, one or more of the following set forth below, or antigens derived from one or more of the 
following set forth below: 
Bacterial Antigens 

N. meningitides: a protein antigen from N. meningitides serogroup A, C, W135, Y, and/or B 
(1-7); an outer-membrane vesicle (OMV) preparation from N. meningitides serogroup B. (8, 9, 10, 
11); a saccharide antigen, including LPS, from N. meningitides serogroup A, B, C W135 and/or Y, 
10 such as the oligosaccharide from serogroup C (see PCTYUS 99/093 46; PCT IB98/01665; and PCT 
IB99/00103); 

Streptococcus pneumoniae: a saccharide or protein antigen, particularly a saccharide from 
Streptooccus pneumoniae^ 

Streptococcus agalactiae: particularly, Group B streptococcus antigens; 
15 Streptococcus pyogenes: particularly, Group A streptococcus antigens; 

Enterococcus faecalis or Enterococcus faecium: Particularly a trisaccharide repeat or other 
Enterococcus derived antigens provided in US Patent No. 6,756,361; 

Helicobacter pylori: including: Cag, Vac, Nap, HopX, HopY and/or urease antigen; 

Bordetella pertussis: such as petussis holotoxin (PT) and filamentous haemagglutinin (FHA) 
20 from B, pertussis^ optionally also combination with pertactin and/or agglutinogens 2 and 3 antigen; 

Staphylococcus aureus: including S. aureus type 5 and 8 capsular polysaccharides optionally 
conjugated to nontoxic recombinant Pseudomonas aeruginosa exotoxin A, such as StaphVAX™, or 
antigens derived from surface proteins, invasins (leukocidin, kinases, hyaluronidase), surface factors 
that inhibit phagocytic engulfment (capsule, Protein A), carotenoids, catalase production, Protein A, 
25 coagulase, clotting factor, and/or membrane-damaging toxins (optionally detoxified) that lyse 
eukaryotic cell membranes (hemolysins, leukotoxin, leukocidin); 

Staphylococcus epidermis: particularly, S. epidermidis slime-associated antigen (SAA); 

Staphylococcus saprophytics: (causing urinary tract infections) particularly the 160 kDa 
hemagglutinin of S. saprophytics antigen; 
30 Pseudomonas aeruginosa: particularly, endotoxin A, Wzz protein, P. aeruginosa LPS, more 

particularly LPS isolated from PAOl (05 serotype), and/or Outer Membrane Proteins, including 
Outer Membrane Proteins F (OprF) (Infect Immun. 2001 May; 69(5): 3510-3515); 
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P C 'Ta^W^SM&&is >,t (imM^:3A as B. anthracis antigens (optionally detoxified) from A- 
components (lethal factor (LF) and edema factor (EF)), both of which can share a common B- 
component known as protective antigen (PA); 

Moraxella catarrhalis: (respiratory) including outer membrane protein antigens (HMW- 
5 OMP), C-antigen, and/or LPS; 

Yersinia pestis (plague): such as Fl capsular antigen (Infect Immun. 2003 Jan; 71(1)): 374- 
383, LPS (Infect Immun. 1999 Oct; 67(10): 5395), Yersinia pestis V antigen (Infect Immun. 1997 
Nov; 65(11): 4476-4482); 

Yersinia enter ocolitica (gastrointestinal pathogen): particularly LPS (Infect Immun. 2002 
10 August; 70(8): 4414); 

Yersinia pseudotuberculosis: gastrointestinal pathogen antigens; 

Mycobacterium tuberculosis: such as lipoproteins, LPS, BCG antigens, a fusion protein of 
antigen 85B (Ag85B) and/or ESAT-6 optionally formulated in cationic lipid vesicles {Infect Immun. 
2004 October; 72(10): 6148), Mycobacterium tuberculosis (Mtb) isocitrate dehydrogenase associated 
15 antigens (Proc Natl Acad Sci USA. 2004 Aug 24; 101(34): 12652), and/or MPT51 antigens 
(Infect Immun. 2004 July; 72(7): 3829); 

Legionella pneumophila (Legionnairs' Disease): L. pneumophila antigens — optionally 
derived from cell lines with disrupted asd genes (Infect Immun. 1998 May; 66(5): 1898); 

Rickettsia: including outer membrane proteins, including the outer membrane protein A 
20 and/or B (OmpB) (Biochim Biophys Acta. 2004 Nov 1 ; 1 702(2): 145), LPS, and surface protein antigen 
(SPA) (JAutoimmun. 1989 Jun;2 Suppl:81); 

E. coli: including antigens from enterotoxigenic E. coli (ETEC), enteroaggregative E. coli 
(EAggEC), diffusely adhering E. coli (DAEC), enteropathogenic E. coli (EPEC), and/or 
enterohemorrhagic E. coli (EHEC); 
25 Vibrio cholerae: including proteinase antigens, LPS, particularly lipopolysaccharides of 

Vibrio cholerae II, Ol Inaba O-specific polysaccharides, V. cholera 0139, antigens of IEM108 
vaccine (Infect Immun. 2003 Oct;71(10):5498-504), and/or Zonula occludens toxin (Zot); 

Salmonella typhi (typhoid fever): including capsular polysaccharides preferably conjugates 
(Vi, i.e. vax-TyVi); 

30 Salmonella typhimurium (gastroenteritis): antigens derived therefrom are contemplated for 

microbial and cancer therapies, including angiogenesis inhibition and modulation of flk; 

Listeria monocytogenes (sytemic infections in immunocompromised or elderly people, 
infections of fetus): antigens derived from L. monocytogenes are preferably used as carriers/vectors 
for intracytopiasmic delivery of conjugates/associated compositions of the present invention; 
35 Porphyromonas gingivalis: particularly, P. gingivalis outer membrane protein (OMP); 

Tetanus: such as tetanus toxoid (TT) antigens, preferably used as a carrier protein in 
conjunction/conjugated with the compositions of the present invention; 
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modulating, inhibiting or associated with ADP ribosylation are contemplated for combination/co- 
administration/conjugation with the compositions of the present invention, the diphtheria toxoids are 
preferably used as carrier proteins; 
5 Borrelia burgdorferi (Lyme disease): such as antigens associated with P39 and P13 (an 

integral membrane protein, Infect Immun. 2001 May; 69(5): 3323-3334), VlsE Antigenic Variation 
Protein (J Clin Microbiol. 1999 Dec; 37(12): 3997); 

Haemophilus influenzae B: such as a saccharide antigen therefrom; 

Klebsiella: such as an OMP, including OMP A, or a polysaccharide optionally conjugated to 
10 tetanus toxoid; 

Neiserria gonorrhoeae: including, a Por (or porin) protein, such as PorB (see Zhu et aL, 
Vaccine (2004) 22:660 - 669), a transferring binding protein, such as TbpA and TbpB (See Price et 
aL, Infection and Immunity (2004) 71(1):277 - 283), a opacity protein (such as Opa), a reduction- 
modifiable protein (Rmp), and outer membrane vesicle (OMV) preparations (see Plante et aL, J 
15 Infectious Disease (2000) 182:848 - 855), also see e.g. W099/24578, W099/36544, WO99/57280, 
WO02/079243); 

Chlamydia pneumoniae: particularly C. pneumoniae protein antigens; 

Chlamydia trachomatis: including antigens derived from serotypes A, B, Ba and C are 
(agents of trachoma, a cause of blindness), serotypes Li, L2 & L 3 (associated with Lymphogranuloma 
20 venereum), and serotypes, D-K; 

Treponema pallidum (Syphilis): particularly a TmpA antigen; and 
Haemophilus ducreyi (causing chancroid): including outer membrane protein (DsrA). 
Where not specifically referenced, further bacterial antigens of the invention may be capsular 
antigens, polysaccharide antigens or protein antigens of any of the above. Further bacterial antigens 
25 may also include an outer membrane vesicle (OMV) preparation. Additionally, antigens include live, 
attenuated, split, and/or purified versions of any of the aforementioned bacteria. The bacterial or 
microbial derived antigens of the present invention may be gram-negative or gram-positive and 
aerobic or anaerobic. 

Additionally, any of the above bacterial-derived saccharides (polysaccharides, LPS, LOS or 
30 oligosaccharides) can be conjugated to another agent or antigen, such as a carrier protein (for example 
CRM 197). Such conjugation may be direct conjugation effected by reductive amination of carbonyl 
moieties on the saccharide to amino groups on the protein, as provided in US Patent No. 5,360,897 
and Can J Biochem Cell Biol. 1984 May;62(5):270-5. Alternatively, the saccharides can be 
conjugated through a linker, such as, with succinamide or other linkages provided in Bioconjagate 
35 Techniques, 1996 and CRC, Chemistry of Protein Conjugation and Cross-Linking, 1993. 
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Influenza: including whole viral particles (attenuated), split, or subunit comprising 
hemagglutinin (HA) and/or neuraminidase (NA) surface proteins, the influenza antigens may be 
derived from chicken embryos or propogated on cell culture, and/or the influenza antigens are derived 
5 from influenza type A, B, and/or C, among others; 

Respirator syncytial virus (RSV); including the F protein of the A2 strain of RSV (J Gen 
Virol 2004 Nov; 85(Pt 11):3229) and/or G glycoprotein; 

Parainfluenza vims (PIV): including PIV type 1, 2, and 3, preferably containing 
hemagglutinin, neuraminidase and/or fusion glycoproteins; 
10 Poliovirus: including antigens from a family of picornaviridae, preferably poliovirus antigens 

such as OPV or, preferably IPV; 

Measles: including split measles virus (MV) antigen optionally combined with the Protollin 
and or antigens present in MMR vaccine; 

Mumps: including antigens present in MMR vaccine; 
15 Rubella: including antigens present in MMR vaccine as well as other antigens from 

Togaviridae, including dengue virus; 

Rabies: such as lyophilized inactivated virus (Rab Avert™); 

Flaviviridae viruses: such as (and antigens derived therefrom) yelow fever virus, Japanese 
encephalitis virus, dengue virus (types 1, 2, 3, or 4), tick borne encephalitis virus, and West Nile 
20 virus; 

Caliciviridae; antigens therefrom; 

HIV: including HIV-1 or HIV-2 strain antigens, such as gag (p24gag and p55gag), env 
(gpl60 and gp41), pol, tat, nef, rev vpu, miniproteins, (preferably p55 gag and gpl40v delete) and 
antigens from the isolates HIV mb , HIV S F2, HIV lav , HIV lai , HIV M n, HIV-1 C m235, HIV-W HIV-2; 
25 simian immunodeficiency virus (SIV) among others; 

Rotavirus: including VP4, VP 5, VP6, VP7, VP 8 proteins {Protein Expr Purifl 2004 
Dec;38(2):205) and/or NSP4; 

Pestivirus: such as antigens from classical porcine fever virus, bovine viral diarrhoea virus, 
and/or border disease virus; 
30 Parvovirus: such as parvovirus B19; 

Coronavirus: including SARS virus antigens, particularly spike protein or proteases 
therefrom, as well as antigens included in WO 04/92360; 

Hepatitis A virus: such as inactivated virus; 

Hepatitis B virus: such as the surface and/or core antigens (sAg), as well as the presurface 
35 sequences, pre-Sl and pre-S2 (formerly called pre-S), as well as combinations of the above, such as 
sAg/pre-Sl, sAg/pre-S2, sAg/pre-Sl/pre-S2, and pre-S l/pre-S2, (see, e.g., AHBV Vaccines - Human 
Vaccines and Vaccination, pp. 159-176; and U.S. Patent Nos. 4,722,840, 5,098,704, 5,324,513; 
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mM^s^'^JB^^(0^yW^^Ms3^ Birnbaum et aL, J. Virol (1990) 64:3319-3330; and 
Zhou et al., J. Virol (1991) 65:5457-5464); 

Hepatitis C vims: such as El, E2, E1/E2 (see, Houghton et aL, Hepatology (1991) 14:381), 
NS345 polyprotein, NS 345-core polyprotein, core, and/or peptides from the nonstructural regions 
5 (International Publication Nos. WO 89/04669; WO 90/1 1089; and WO 90/14436); 

Delta hepatitis virus (HDV): antigens derived therefrom, particularly S-antigen from HDV 
(see, e.g., U.S. Patent No. 5,378,814); 

Hepatitis E virus (HEV); antigens derived therefrom; 

Hepatitis G virus (HGV); antigens derived therefrom; 
10 Varcicella zoster virus: antigens derived from varicella zoster virus (VZV) (J. Gen. Virol 

(1986) 67:1759); 

Epstein-Barr virus: antigens derived from EBV (Baer et aL, Nature (1984) 310:207); 

Cytomegalovirus: CMV antigens, including gB and gH {Cytomegaloviruses (J.K. McDougall, 
ed., Springer-Verlag 1990) pp. 125-169); 
15 Herpes simplex virus: including antigens from HSV-1 or HSV-2 strains and glycoproteins gB, 

gD and gH (McGeoch et al., J. Gen. Virol (1988) 69: 153 1 and U.S. Patent No. 5,171,568); 

Human Herpes Vims: antigens derived from other human herpesviruses such as HHV6 and 
HHV7; and 

HPV: including antigens associated with or derived from human papillomavirus (HPV), for 
20 example, one or more of El — E7, LI, L2, and fusions thereof, particularly the compositions of the 
invention may include a virus-like particle (VLP) comprising the LI major capsid protein, more 
particular still, the HPV antigens are protective against one or more of HPV serotypes 6, 11, 16 and/or 
18. 

Further provided are antigens, compostions, methods, and microbes included in Vaccines, 4 th 
25 Edition (Plotkin and Orenstein ed. 2004); Medical Microbiology 4 th Edition (Murray et aL ed. 2002); 
Virology, 3rd Edition (W.K. Joklik ed. 1988); Fundamental Virology, 2nd Edition (B.N. Fields and 
D.M. Knipe, eds. 1991), which are contemplated in conjunction with the compositions of the present 
invention. 

Additionally, antigens include live, attenuated, split, and/or purified versions of any of the 
30 aforementioned viruses. 
Fungal Antigens 

Fungal antigens for use herein, associated with vaccines include those described in: U.S. Pat. 
Nos. 4,229,434 and 4,368,191 for prophylaxis and treatment of trichopytosis caused by Trichophyton 
mentagrophytes; U.S. Pat. Nos. 5,277,904 and 5,284,652 for a broad spectrum dermatophyte vaccine 
35 for the prophylaxis of dermatophyte infection in animals, such as guinea pigs, cats, rabbits, horses and 
lambs, these antigens comprises a suspension of killed T. equinum, T. mentagrophytes (var. 
granulare), M cards and/or M. gypseum in an effective amount optionally combined with an adjuvant; 
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homogenized, formaldehyde-killed fungi, i.e., Microsporum canis culture in a carrier; U.S. Pat. No. 
5,948,413 involving extracellular and intracellular proteins for pythiosis. Additional antigens 
identified within antifungal vaccines include Ringvac bovis LTF-130 and Bioveta. 
5 Further, fungal antigens for use herein may be derived from Dermatophytres, including: 

Epidermophyton floccusum, Microsporum audouini, Microsporum canis, Microsporum distortum, 
Microsporum equinum, Microsporum gypsum, Microsporum nanum, Trichophyton concentricum, 
Trichophyton equinum, Trichophyton gallinae, Trichophyton gypseum, Trichophyton megnini, 
Trichophyton mentagrophytes, Trichophyton quinckeanum, Trichophyton rubrum, Trichophyton 

10 schoenleini, Trichophyton tonsurans, Trichophyton verrucosum, T. verrucosum var. album, var. 
discoides, var. ochraceum, Trichophyton violaceum, and/or Trichophyton faviforme. 

Fungal pathogens for use as antigens or in derivation of antigens in conjunction with the 
compositions of the present invention comprise Aspergillus fumigatus, Aspergillus flavus, Aspergillus 
niger, Aspergillus nidulans, Aspergillus terreus, Aspergillus sydowi, Aspergillus Jlavatus, Aspergillus 

15 glaucus, Blastoschizomyces capitatus, Candida albicans, Candida enolase, Candida tropicalis, 
Candida glabrata, Candida krusei, Candida parapsilosis, Candida stellatoidea, Candida kusei, 
Candida parakwsei, Candida lusitaniae, Candida pseudo tropicalis, Candida guilliermondi, 
Cladosporium carrionii, Coccidioides immitis, Blastomyces dermatidis, Ciyptococcus neoformans, 
Geotrichum clavatum, Histoplasma capsulatum, Klebsiella pneumoniae, Paracoccidioides 

20 brasiliensis, Pneumocystis carinii, Pythiumn insidiosum, Pityrosporum ovale, Sacharomyces 
cerevisae, Saccharomyces boulardii, Saccharomyces pombe, Scedosporium apiosperum, Sporothrix 
schenckii, Trichosporon beigelii, Toxoplasma gondii, Penicillium marneffei, Malassezia spp., 
Fonsecaea spp., Wangiella spp., Sporothrix spp., Basidiobolus spp., Conidiobolus spp., Rhizopus spp, 
Mucor spp, Absidia spp, Mortierella spp, Cunninghamella spp, and Saksenaea spp. 

25 Other fungi from which antigens are derived include Alternaria spp, Curvularia spp, 

Helminthosporium spp, Fusarium spp, Aspergillus spp, Penicillium spp, Monolinia spp, Rhizoctonia 
spp, Paecilomyces spp, Pithomyces spp, and Ciadosporium spp. 

Processes for producing a fungal antigens are well known in the art (see US Patent No. " 
6,333,164). In a preferred method a soiubilized fraction extracted and separated from an insoluble 

30 fraction obtainable from fungal cells of which cell wall has been substantially removed or at least 
partially removed, characterized in that the process comprises the steps of: obtaining living fungal 
cells; obtaining fungal cells of which cell wall has been substantially removed or at least partially 
removed; bursting the fungal cells of which cell wall has been substantially removed or at least 
partially removed; obtaining an insoluble fraction; and extracting and separating a soiubilized fraction 

35 from the insoluble fraction. 
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In particular embodiments, microbes (bacteria, viruses and/or fungi) against which the present 
compositions and methods can be implement include those that cause sexually transmitted diseases 
(STDs) and/or those that display on their surface an antigen that can be the target or antigen 
5 composition of the invention. In a preferred embodiment of the invention, compositions are combined 
with antigens derived from a viral or bacterial STD. Antigens derived from bacteria or viruses can be 
administered in conjunction with the compositions of the present invention to provide protection 
against at least one of the following STDs, among others: chlamydia, genital herpes, hepatitis 
(particularly HCV), genital warts, gonorrhoea, syphilis and/or chancroid (See, WOOO/15255). 
10 In another embodiment the compositions of the present invention are co-administered with an 

antigen for the prevention or treatment of an STD. 

Antigens derived from the following viruses associated with STDs, which are described in 
greater detail above, are preferred for co-administration with the compositions of the present 
invention: hepatitis (particularly HCV), HPV, HIV, or HS V. 
15 Additionally, antigens derived from the following bacteria associated with STDs, which are 

described in greater detail above, are preferred for co-administration with the compositions of the 
present invention: Neiserria gonorrhoeae, Chlamydia pneumoniae, Chlamydia trachomatis, 
Treponema pallidum, or Haemophilus ducreyi. 
Resviratorv Antigens 

20 The antigen may be a respiratory antigen and could further be used in an immunogenic 

composition for methods of preventing and/or treating infection by a respiratory pathogen, including a 
virus, bacteria, or fungi such as respiratory syncytial virus (RSV), PIV, SARS virus, influenza, 
Bacillus anthracis, particularly by reducing or preventing infection and/or one or more symptoms of 
respiratory virus infection. A composition comprising an antigen described herein, such as one 

25 derived from a respiratory virus, bacteria or fungus is administered in conjunction with the 

compositions of the present invention to an individual which is at risk of being exposed to that 
particular respiratory microbe, has been exposed to a respiratory microbe or is infected with a 
respiratory virus, bacteria or fungus. The composition(s) of the present invention is/are preferably co- 
administered at the same time or in the same formulation with an antigen of the respiratory pathogen. 

30 Administration of the composition results in reduced incidence and/or severity of one or more 
symptoms of respiratory infection. 
Pediatric/Geriatric Antigens 

In one embodiment the compositions of the present invention are used in conjunction with an 
antigen for treatment of a pediatric population, as in a pediatric antigen. In a more particular 

35 embodiment the pediatric population is less than about 3 years old, or less than about 2 years, or less 
than about 1 years old. In another embodiment the pediatric antigen (in conjunction with the 
composition of the present invention) is administered multiple times over at least 1, 2, or 3 years. 
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an antigen for treatment of a geriatric population, as in a geriatric antigen. 
Other Antigens 

Other antigens for use in conjunction with the compositions of the present include hospital 
5 acquired (nosocomial) associated antigens. 

In another embodiment, parasitic antigens are contemplated in conjunction with the 
compositions of the present invention. Examples of parasitic antigens include those derived from 
organisms causing malaria and/or Lyme disease. 

In another embodiment, the antigens in conjunction with the compositions of the present 
10 invention are associated with or effective against a mosquito born illness. In another embodiment, the 
antigens in conjunction with the compositions of the present invention are associated with or effective 
against encephalitis. In another embodiment the antigens in conjunction with the compositions of the 
present invention are associated with or effective against an infection of the nervous system. 

In another embodiment, the antigens in conjunction with the compositions of the present 
1 5 invention are antigens transmissible through blood or body fluids. 
Antigen Formulations 

In other aspects of the invention, methods of producing microparticles 
having adsorbed antigens are provided. The methods comprise: (a) providing an emulsion by 
dispersing a mixture comprising (i) water, (ii) a detergent, (iii) an organic solvent, and (iv) a 
20 biodegradable polymer selected from the group consisting of a poly(a-hydroxy acid), a polyhydroxy 
butyric acid, a polycaprolactone, a polyorthoester, a polyanhydride, and a polycyanoacrylate. The 
polymer is typically present in the mixture at a concentration of about 1% to about 30% relative to the 
organic solvent, while the detergent is typically present in the mixture at a weight-to-weight 
detergent-to-polymer ratio of from about 0.00001:1 to about 0.1:1 (more typically about 0.0001:1 to 
25 about 0.1:1, about 0.001:1 to about 0.1:1, or about 0.005:1 to about 0.1:1); (b) removing the organic 
solvent from the emulsion; and (c) adsorbing an antigen on the surface of the microparticles. In 
certain embodiments, the biodegradable polymer is present at a concentration of about 3% to about 
10% relative to the organic solvent. 

Microparticles for use herein will be formed from materials that are 
30 sterilizable, non-toxic and biodegradable. Such materials include, without limitation, poly(a-hydroxy 
acid), polyhydroxybutyric acid, polycaprolactone, polyorthoester, polyanhydride, PACA, and 
polycyanoacrylate. Preferably, microparticles for use with the present invention are derived from a 
poly(a-hydroxy acid), in particular, from a poly(lactide) ("PLA") or a copolymer of D,L-lactide and 
glycolide or glycolic acid, such as a poly(D,L-lactide-co-glycolide) ("PLG M or "PLGA"), or a 
35 copolymer of D,L-lactide and caprolactone. The microparticles may be derived from any of various 
polymeric starting materials which have a variety of molecular weights and, in the case of the 
copolymers such as PLG, a variety of lactide: glycolide ratios, the selection of which will be largely a 
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matter, ojp cli<j|ije£^ coadministered macromolecule. These parameters are 

discussed more fully below. 

Further antigens may also include an outer membrane vesicle (OMV) preparation. 
Additional formulation methods and antigens (especially tumor antigens) are provided in U.S. 
5 Patent Serial No. 09/581,772. 
Antigen References 

The following references include antigens useful in conjunction with the compositions of 
the present invention: " 

1 0 1 International patent application W099/24578 

2 International patent application W099/36544. 

3 International patent application WO99/57280. 

4 International patent application WO00/22430. 

5 Tettelin et al. (2000) Science 287:1809-1815. 
15 6 International patent application W096/29412. 

7 Pizza et al. (2000) Science 287:1816-1820. 

8 PCT WO 01/52885. 

9 Bjune et al. (1991) Lancet 338(8775). 

10 Fuskasawa et al. (1999) Vaccine 17:2951-2958. 

20 11 Rosenqist et al. (1998) Dev. Biol. Strand 92:323-333. 

12 Constantino et al. (1992) Vaccine 10:691-698. 

13 Constantino et al. (1999) Vaccine 17: 1251-1263. 

14 Watson (2000) Pediatr Infect Dis J 19:331-332. 

15 Rubin (20000) Pediatr Clin North Am 47:269-285,v. 
25 16 Jedrzejas (2001) Microbiol Mol Biol Rev 65: 187-207. 

17 International patent application filed on 3 rd July 2001 claiming priority from GB- 
0016363.4; WO 02/02606; PCT IB/01/00166. 

18 Kalman et al. (1999) Nature Genetics 21:385-389. 

19 Read et al. (2000) Nucleic Acids Res 28: 1397-406. 

30 20 Shirai et al. (2000) J. Infect. Dis 181(Suppl 3):S524-S527. 

21 International patent application WO99/27105. 

22 International patent application WO00/27994. 

23 International patent application WO00/37494. 

24 International patent application W099/28475. 
35 25 Bell (2000) Pediatr Infect Dis J 19:1 187-1 188. 

26 Iwarson(1995) APMIS 103:321-326. 

27 Gerlich et al. (1990) Vaccine 8 Suppl:S63-68 & 79-80. 

28 Hsu et al. (1999) Clin Liver Dis 3:901-915. 

29 Gastofsson et al. (1996) N. Engl. J. Med. 334-:349-355. 
40 30 Rappuoli et al. (1991) TIBTECH 9:232-238. 

31 Vaccines (1988) eds. Plotkin & Mortimer. ISBN 0-7216-1946-0. 

32 Del Guidice et al. (1998) Molecular Aspects of Medicine 19: 1-70. 

33 International patent application WO93/0 18150. 

34 International patent application W099/533 1 0. 
45 35 International patent application WO98/04702. 

36 Ross et al. (2001) Vaccine 19:135-142. 

37 Sutter et al. (2000) Pediatr Clin North Am 47:287-308. 

38 Zimmerman & Spann (1999) Am Fan Physician 59:1 13-1 18, 125-126. 

39 Dreensen (1997) Vaccine 15 Suppl"S2-6. 

50 40 MMWRMorb Mortal Wkly rep 1998 Jan 16:47(1):12, 9. 

41 McMichael (2000) Vaccinel9 Suppl 1:S101-107. 
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" b GB patent applicatife 0028727.6 & 0105640.7. 

44 Dale (1999) Infect Disclin North Am 13:227-43, viii. 

45 Ferretti et al. (2001) PNAS USA 98: 4658-4663. 

5 46 Kuroda et al. (2001) Lancet 357(9264): 1225-1240; see also pages 1218-1219. 

47 Ramsay etal. (2001) Lancet 357(9251):195-196. 

48 Lindberg (1999) Vaccine 17 Suppl.2:S28-36. 

49 Buttery & Moxon (2000) J R Coil Physicians Long 34:163-168. 

50 Ahmad & Chapnick (1999) InfectDis Clin North Am 13:1 13-133, vii. 
10 51 Goldblatt (1998) J. Med. Microbiol. 47:663-567. 

52 European patent 0 477 508. 

53 U.S. Patent No. 5,306,492. 

54 International patent application W098/42721. 

55 Conjugate Vaccines (eds. Cruse et al.) ISBN 3805549326, particularly vol. 10:48-1 14. 
15 56 Hermanson (1996) Bioconjugate Techniques ISBN: 012323368 & 012342335X. 

57 European patent application 037250 1 . 

58 European patent application 037888 1 . 

59 European patent application 0427347. 

60 International patent application W093/1771 2. 
20 61 International patent application WG98/58668. 

62 European patent application 0471 177. 

63 International patent application WO00/56360. 

64 International patent application WO 00/67 161. 

25 The contents of all of the above cited patents, patent applications and journal articles are 

incorporated by reference as if set forth fully herein. 

There may be an upper limit to the number of Gram positive bacterial proteins which will be 
in the compositions of the invention. Preferably, the number of Gram positive bacterial proteins in a 
composition of the invention is less than 20, less than 19, less than 18, less than 17, less than 16, less 
30 than 15, less than 14, less than 13, less than 12, less than 11, less than 10, less than 9, less than 8, less 
than 7, less than 6, less than 5, less than 4, or less than 3. Still more preferably, the number of Gram 
positive bacterial proteins in a composition of the invention is less than 6, less than 5, or less than 4. 
Still more preferably, the number of Gram positive bacterial proteins in a composition of the 
invention is 3 . 

35 The Gram positive bacterial proteins and polynucleotides used in the invention are preferably 

isolated, i.e., separate and discrete, from the whole organism with which the molecule is found in 
nature or, when the polynucleotide or polypeptide is not found in nature, is sufficiently free of other 
biological macromolecules so that the polynucleotide or polypeptide can be used for its intended 
purpose. 

40 Fusion Proteins: GBS Al sequences 

The GBS Al proteins used in the invention may be present in the composition as individual 
separate polypeptides, but it is preferred that at least two (i.e. 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 
15, 16, 17, or 18) of the antigens are expressed as a single polypeptide chain (a "hybrid" or "fusion 55 
polypeptide). Such fusion polypeptides offer two principal advantages: first, a polypeptide that may 
45 be unstable or poorly expressed on its own can be assisted by adding a suitable fusion partner that 
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o|er£Qmps thjp pro^-le|$»»se^ manufacture is simplified as only one expression and 

purification need be employed in order to produce two polypeptides which are both antigenically 
useful. 

The fusion polypeptide may comprise one or more AI polypeptide sequences. Preferably, the 

5 fusion comprises an AI surface protein sequence. Preferably, the fusion polypeptide includes one or 

more of GBS 80, GBS 104, and GBS 67. Most preferably, the fusion peptide includes a polypeptide 

sequence from GBS 80. Accordingly, the invention includes a fusion peptide comprising a first 

amino acid sequence and a second amino acid sequence, wherein said first and second amino acid 

sequences are selected from a GBS AI surface protein or a fragment thereof. Preferably, the first and 

10 second amino acid sequences in the fusion polypeptide comprise different epitopes. 

Hybrids (or fusions) consisting of amino acid sequences from two, three, four, five, six, 

seven, eight, nine, or ten GBS antigens are preferred. In particular, hybrids consisting of amino acid 

sequences from two, three, four, or five GBS antigens are preferred. 

Different hybrid polypeptides may be mixed together in a single formulation. Within such 

15 combinations, a GBS antigen may be present in more than one hybrid polypeptide and/or as a 

non-hybrid polypeptide. It is preferred, however, that an antigen is present either as a hybrid or as a 

non-hybrid, but not as both. 

Hybrid polypeptides can be represented by the formula NH 2 -A-{-X-L-} w -B-COOH, wherein: 

X is an amino acid sequence of a GBS AI protein or a fragment thereof; L is an optional linker amino 

20 acid sequence; A is an optional N-terminal amino acid sequence; B is an optional C-terminal amino 

acid sequence; and n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14 or 15. 

If a -X- moiety has a leader peptide sequence in its wild-type form, this may be included or 

omitted in the hybrid protein. In some embodiments, the leader peptides will be deleted except for 

that of the -X- moiety located at the N-terminus of the hybrid protein i.e. the leader peptide of Xi will 

25 be retained, but the leader peptides of X 2 . . . X n will be omitted. This is equivalent to deleting all 

leader peptides and using the leader peptide of Xi as moiety -A-. 

For each n instances of {-X-L-}, linker amino acid sequence -L- may be present or absent. 

For instance, when n=2 the hybrid may be NHz-XirLj-X^Lz-COOH, NHz-Xj^-COOH, 

NH2-X1-L1-X2-COOH, NH2-X1-X2-L2-COOH, etc. Linker amino acid sequence(s) -L- will typically 

30 be short (e.g. 20 or fewer amino acids i.e. 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). 

Examples comprise short peptide sequences which facilitate cloning, poly-glycine linkers (i.e. 

comprising Gly* where n = 2, 3, 4, 5, 6, 7, 8, 9, 10 or more), and histidine tags (i.e. His„ where n = 3, 

4, 5, 6, 7, 8, 9, 10 or more). Other suitable linker amino acid sequences will be apparent to those 

skilled in the art. A useful linker is GSGGGG, with the Gly-Ser dipeptide being formed from a 

35 BamHI restriction site, thus aiding cloning and manipulation, and the (Gly) 4 tetrapeptide being a 

typical poly-glycine linker. 

-A- is an optional N-terminal amino acid sequence. This will typically be short (e.g. 40 or 

fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 
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P'^^yfti &Sl?.V l fe::! 1 '^' fe??.:lf^f ' 5 ' 4 ' 3 ' 2 ' Exa »iples include leader sequences to direct 
protein trafficking, or short peptide sequences which facilitate cloning or purification (e.g. histidine 
tags i.e. His„ where n = 3, 4, 5, 6, 7, 8, 9, 10 or more). Other suitable N-terminal amino acid 
sequences will be apparent to those skilled in the art. If X! lacks its own N-terminus methionine, -A- 
is preferably an oligopeptide (e.g. with 1, 2, 3, 4, 5, 6, 7 or 8 amino acids) which provides a 
N-terminus methionine. 

-B- is an optional C~terminal amino acid sequence. This will typically be short (e.g. 40 or 
fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 
18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include sequences to direct 
protein trafficking, short peptide sequences which facilitate cloning or purification (e.g. comprising 
histidine tags i.e. His,, where n = 3, 4, 5, 6, 7, 8, 9, 10 or more), or sequences which enhance protein 
stability. Other suitable C-terminal amino acid sequences will be apparent to those skilled in the art. 

Most preferably, n is 2 or 3. 
Fusion Proteins: Gram positive bacteria AI sequences 

The Gram positive bacteria AI proteins used in the invention may be present in the 
composition as individual separate polypeptides, but it is preferred that at least two (i.e. 2, 3, 4, 5, 6, 7, 
8, 9, 10, 11, 12, 13, 14, 15, 16, 17, or 18) of the antigens are expressed as a single polypeptide chain 
(a "hybrid" or "fusion" polypeptide). Such fusion polypeptides offer two principal advantages: first, a 
polypeptide that may be unstable or poorly expressed on its own can be assisted by adding a suitable 
fusion partner that overcomes the problem; second, commercial manufacture is simplified as only one 
expression and purification need be employed in order to produce two polypeptides which are both 
antigenically useful. 

The fusion polypeptide may comprise one or more AI polypeptide sequences. Preferably, the 
fusion comprises an AI surface protein sequence. Accordingly, the invention includes a fusion 
peptide comprising a first amino acid sequence and a second amino acid sequence, wherein said first 
and second amino acid sequences are selected from a Gram positive bacteria AI protein or a fragment 
thereof. Preferably, the first and second amino acid sequences in the fusion polypeptide comprise 
different epitopes. 

Hybrids (or fusions) consisting of amino acid sequences from two, three, four, five, six, 
seven, eight, nine, or ten Gram positive bacteria antigens are preferred. In particular, hybrids 
consisting of amino acid sequences from two, three, four, or five Gram positive bacteria antigens are 
preferred. 

Different hybrid polypeptides may be mixed together in a single formulation. Within such 
combinations, a Gram positive bacteria AI sequence may be present in more than one hybrid 
polypeptide and/or as a non-hybrid polypeptide. It is preferred, however, that an antigen is present 
either as a hybrid or as a non-hybrid, but not as both. 

Hybrid polypeptides can be represented by the formula NH 2 -A-{-X-L-}„-B-COOH, wherein: 

X is an amino acid sequence of a Gram positive bacteria AI sequence or a fragment thereof; L is an 
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opJj[^nalj|linJc|r j&f,*4 ,,s f l u ? n f & fejf 311 optional N-terminal amino acid sequence; B is an 
optional C-terminal amino acid sequence; and n is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15. 

If a -X- moiety has a leader peptide sequence in its wild-type form, this may be included or 
omitted in the hybrid protein. In some embodiments, the leader peptides will be deleted except for 
5 that of the -X- moiety located at the N- terminus of the hybrid protein Le. the leader peptide of Xi will 
be retained, but the leader peptides of X 2 . . . X n will be omitted. This is equivalent to deleting all 
leader peptides and using the leader peptide of Xi as moiety -A-. 

For each n instances of {-X-L-}, linker amino acid sequence -L- may be present or absent. 
For instance, when n=2 the hybrid may be NH 2 -X 1 -L 1 -X 2 -L 2 -COOH, NH^XrXs-COOH, 

10 NH 2 -Xi-Li-X 2 -COOH, NH2-X1-X2-L2-COOH, etc. Linker amino acid sequence(s) -L- will typically 
be short (e.g. 20 or fewer amino acids Le. 19, 18, 17, 16, 15, 14, 13, 12, 1 1, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). 
Examples comprise short peptide sequences which facilitate cloning, pofy-glycine linkers (Le. 
comprising Gly w where n = 2, 3, 4, 5, 6, 7, 8,9, 10 or more), and histidine tags (i.e. His„ where n = 3, 
4, 5, 6, 7, 8, 9, 10 or more). Other suitable linker amino acid sequences will be apparent to those 

15 skilled in the art. A useful linker is GSGGGG, with the Gly-Ser dipeptide being formed from a 
BamHI restriction site, thus aiding cloning and manipulation, and the (Gly) 4 tetrapeptide being a 
typical poly-glycine linker. 

-A- is an optional N-terminal amino acid sequence. This will typically be short (e.g. 40 or 
fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 

20 18, 17, 16, 15, 14, 13, 12, 1 1, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include leader sequences to direct 
protein trafficking, or short peptide sequences which facilitate cloning or purification (e.g. histidine 
tags i.e. His w where n = 3, 4 3 5, 6, 7, 8, 9, 10 or more). Other suitable N-terminal amino acid 
sequences will be apparent to those skilled in the art. If Xi lacks its own N-terminus methionine, -A- 
is preferably an oligopeptide (e.g. with 1, 2, 3, 4, 5, 6, 7 or 8 amino acids) which provides a 

25 N-terminus methionine. 

-B- is an optional C-terminal amino acid sequence. This will typically be short (e.g. 40 or 
fewer amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 
18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include sequences to direct 
protein trafficking, short peptide sequences which facilitate cloning or purification (e.g. comprising 

30 histidine tags Le. His„ where n — 3, 4, 5, 6, 7, 8, 9, 10 or more), or sequences which enhance protein 
stability. Other suitable C-terminal amino acid sequences will be apparent to those skilled in the art. 

Most preferably, n is 2 or 3. 
Antibodies: GBS AI sequences 

The GBS AI proteins of the invention may also be used to prepare antibodies specific to the 

35 GBS AI proteins. The antibodies are preferably specific to the an oligomeric or hyper-oligomeric 

form of an AI protein. The invention also includes combinations of antibodies specific to GBS AI 

proteins selected to provide protection against an increased range of GBS serotypes and strain 

isolates. For example, a combination may comprise a first and second antibody, wherein said first 
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antibody ||is s^|jij|t©|«jj jjjBr.stjjgjB^ AJ g|q||jin and said second antibody is specific to a second GBS AI 

protein. Preferably, the nucleic acid sequence encoding said first GBS AI protein is not present in a 
GBS genome comprising a polynucleotide sequence encoding for said second GBS AI protein. 
Preferably, the nucleic acid sequence encoding said first and second GBS AI proteins are present in 
5 the genomes of multiple GBS serotypes and strain isolates. 

The GBS specific antibodies of the invention include one or more biological moieties that, 
through chemical or physical means, can bind to or associate with an epitope of a GBS polypeptide. 
The antibodies of the invention include antibodies which specifically bind to a GBS AI protein. The 
invention includes antibodies obtained from both polyclonal and monoclonal preparations, as well as 

10 the following: hybrid (chimeric) antibody molecules (see, for example, Winter et al (1991) Nature 

349 : 293-299; and US Patent No. 4,816,567; F(ab')2 and F(ab) fragments; F v molecules (non-covalent 
heterodimers, see, for example, Inbar et al. (1972) Proc Natl Acad Sci USA 69:2659-2662; and 
Ehrlich et al. (1980) Biochem 19:4091-4096); single-chain Fv molecules (sFv) (see, for example, 
Huston et al. (1988) Proc Natl Acad Sci USA 85:5897-5883); dimeric and trimeric antibody fragment 

15 constructs; minibodies (see, e.g., Pack et al. (1992) Biochem 31: 1579-1584; Cumber et al (1992) J 
Immunology 149B : 120-126); humanized antibody molecules (see, for example, Rieclimann et al. 
(1988) Nature 332:323-327; Verhoeyan et al. (1988) Science 239:1534-1536; and U.K. Patent 
Publication No. GB 2,276,169, published 21 September 1994); and, any functional fragments 
obtained from such molecules, wherein such fragments retain immunological binding properties of the 

20 parent antibody molecule. The invention further includes antibodies obtained through non- 
conventional processes, such as phage display. 

Preferably, the GBS specific antibodies of the invention are monoclonal antibodies. 
Monoclonal antibodies of the invention include an antibody composition having a homogeneous 
antibody population. Monoclonal antibodies of the invention may be obtained from murine 

25 hybridomas, as well as human monoclonal antibodies obtained using human rather than murine 

hybridomas. See, e.g., Cote, et al. Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 1985, p 
77. 

The antibodies of the invention may be used in diagnostic applications, for example, to detect 
the presence or absence of GBS in a biological sample. The antibodies of the invention may also be 

30 used in the prophylactic or therapeutic treatment of GBS infection. 
Antibodies: Gram positive bacteria AI sequences 

The Gram positive bacteria AI proteins of the invention may also be used to prepare 
antibodies specific to the Gram positive bacteria AI proteins. The antibodies are preferably specific to 
the an oligomeric or hyper-oligomeric form of an AI protein. The invention also includes 

35 combinations of antibodies specific to Gram positive bacteria AI proteins selected to provide 

protection against an increased range of Gram positive bacteria genus, species, serotypes and strain 
isolates. 
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^ ,,|;pr ^|rnp|^ | ; corjab^a|iQr^ rna^pomprise a first and second antibody, wherein said first 

antibody is specific to a first Gram positive bacteria AI protein and said second antibody is specific to 
a second Gram positive bacteria AI protein. Preferably, the nucleic acid sequence encoding said first 
Gram positive bacteria AI protein is not present in a Gram positive bacterial genome comprising a 
polynucleotide sequence encoding for said second Gram positive bacteria AI protein. Preferably, the 
nucleic acid sequence encoding said first and second Gram positive bacteria AI proteins are present in 
the genomes of multiple Gram positive bacteria genus, species, serotypes or strain isolates. 

As an example of an instance where the combination of antibodies provides protection against 
an increased range of bacteria serotypes, the first antibody may be specific to a first GAS AI protein 
and the second antibody may be specific to a second GAS AI protein. The first GAS AI protein may 
comprise a GAS AI-1 surface protein, while the second GAS AI protein may comprise a GAS AI-2 or 
AI-3 surface protein. 

As an example of an instance where the combination of antibodies provides protection against 
an increased range of bacterial species, the first antibody may be specific to a GBS AI protein and the 
second antibody may be specific to a GAS AI protein. Alternatively, the first antibody may be 
specific to a GAS AI protein and the second antibody may be specific to a Si pneumoniae AI protein. 

The Gram positive specific antibodies of the invention include one or more biological 
moieties that, through chemical or physical means, can bind to or associate with an epitope of a Gram 
positive bacteria AI polypeptide. The antibodies of the invention include antibodies which 
specifically bind to a Gram positive bacteria AI protein. The invention includes antibodies obtained 
from both polyclonal and monoclonal preparations, as well as the following: hybrid (chimeric) 
antibody molecules (see, for example, Winter et al (1991) Nature 349: 293-299; and US Patent No. ' 
4,816,567; F(ab') 2 andF(ab) fragments; F v molecules (non-covalent heterodimers, see, for example, 
Inbar et al (1972) Proc Natl Acad Sci USA 69:2659-2662; and Ehrlich et al (1980) Biochem 
19:4091-4096); single-chain Fv molecules (sFv) (see, for example, Huston et al (1988) Proc Natl 
Acad Sci USA 85*5897-5883); dimeric and trimeric antibody fragment constructs; minibodies (see, 
e.g., Vz.ok.etal (1992) £zoc/*e™ 31:1579-1584; Cumber^ al. (1992) J Immunology 149B : 120-126); 
humanized antibody molecules (see, for example, Riechmann et al. (1988) Nature 332:323-327: 
Verhoeyan et al (1988) Science 239:1534-1536; and U.K. Patent Publication No. GB 2,276,169, 
published 21 September 1994); and, any functional fragments obtained from such molecules, wherein 
such fragments retain immunological binding properties of the parent antibody molecule. The 
invention further includes antibodies obtained through non-conventional processes, such as phage 
display. 

Preferably, the Gram positive specific antibodies of the invention are monoclonal antibodies. 
Monoclonal antibodies of the invention include an antibody composition having a homogeneous 
antibody population. Monoclonal antibodies of the invention may be obtained from murine 
hybridomas, as well as human monoclonal antibodies obtained using human rather than murine 
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h|br|4,Q^s iV |3e^ and Cancer Therapy, Alan R. Liss, 1985, p 

The antibodies of the invention may be used in diagnostic applications, for example, to detect 
the presence or absence of Gram positive bacteria in a biological sample. The antibodies of the 
5 invention may also be used in the prophylactic or therapeutic treatment of Gram positive bacteria 
infection. 
Nucleic Acids 

The invention provides nucleic acids encoding the Gram positive bacteria sequences and/or 

the hybrid fusion polypeptides of the invention. The invention also provides nucleic acid encoding 
10 the GBS antigens and/or the hybrid fusion polypeptides of the invention. Furthermore, the invention 

provides nucleic acid which can hybridise to these nucleic acids, preferably under "high stringency" 

conditions {e.g. 65°C in a O.lxSSC, 0.5% SDS solution). 

Polypeptides of the invention can be prepared by various means {e.g. recombinant expression, 

purification from cell culture, chemical synthesis, etc.) and in various forms {e.g. native, fusions, 
15 non-glycosylated, lipidated, etc.). They are preferably prepared in substantially pure form {i.e. 

substantially free from other GAS or host cell proteins). 

Nucleic acid according to the invention can be prepared in many ways {e.g. by chemical 

synthesis, from genomic or cDNA libraries, from the organism itself, etc.) and can take various forms 

{e.g. single stranded, double stranded, vectors, probes, etc.). They are preferably prepared in 
20 substantially pure form {i.e. substantially free from other GBS or host cell nucleic acids). 

The term "nucleic acid" includes DNA and RNA, and also their analogues, such as those 

containing modified backbones {e.g. phosphorothioates, etc.), and also peptide nucleic acids (PNA), 

etc. The invention includes nucleic acid comprising sequences complementary to those described 

above {e.g. for antisense or probing purposes). 
25 The invention also provides a process for producing a polypeptide of the invention, 

comprising the step of culturing a host cell transformed with nucleic acid of the invention under 

conditions which induce polypeptide expression. 

The invention provides a process for producing a polypeptide of the invention, comprising the 

step of synthesising at least part of the polypeptide by chemical means. 
30 The invention provides a process for producing nucleic acid of the invention, comprising the 

step of amplifying nucleic acid using a primer-based amplification method {e.g. PGR). 

The invention provides a process for producing nucleic acid of the invention, comprising the 

step of synthesising at least part of the nucleic acid by chemical means. 

Purification and Recombinant Expression 
35 The Gram positive bacteria AI proteins of the invention may be isolated from the native Gram 

positive bacteria, or they may be recombinantly produced, for instance in a heterologous host. For 

example, the GAS, GBS, and S. pneumoniae antigens of the invention may be isolated from 
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S^e^pc^ccus^ f^}f% t ^ , ^%^j^f^f^^f nmmon ^ ae9 or ma ^ ^ e recombinantly produced, for 
instance, in a heterologous host. Preferably, the GBS antigens are prepared using a heterologous host. 

The heterologous host may be prokaryotic (e.g. a bacterium) or eukaryotic. It is preferably 
E.coli, but other suitable hosts include Bacillus subtilis, Vibrio cholerae, Salmonella typhi, Salmonella 
typhimurium, Neisseria lactamica, Neisseria cinerea, Mycobacteria (e.g. M. tuberculosis), S. gordonii, 
L. lactis y yeasts, etc. 

Recombinant production of polypeptides is facilitated by adding a tag protein to the Gram 
positive bacteria AI sequence to be expressed as a fusion protein comprising the tag protein and the 
Gram positive bacteria antigen. For example, recombinant production of polypeptides is facilitated by 
adding a tag protein to the GBS antigen to be expressed as a fusion protein comprising the tag protein 
and the GBS antigen. Such tag proteins can facilitate purification, detection and stability of the 
expressed protein. Tag proteins suitable for use in the invention include a polyarginine tag (Arg-tag), 
polyhistidine tag (His-tag), FLAG-tag, Strep-tag, c-myc-tag, S-tag, calmodulin-binding peptide, 
cellulose-binding domain, SBP-tag„ chitin-binding domain, glutathione S-transferase-tag (GST), 
maltose-binding protein, transcription termination anti-terminiantion factor (NusA), E. coli 
thioredoxin (TrxA) and protein disulfide isomerase I (DsbA). Preferred tag proteins include His-tag 
and GST. A full discussion on the use of tag proteins can be found at Terpe et al., "Overview of tag 
protein fusions: from molecular and biochemical fundamentals to commercial systems", Appl 
Microbiol Biotechnol (2003) 60:523 - 533. 

After purification, the tag proteins may optionally be removed from the expressed fusion 
protein, i.e., by specifically tailored enzymatic treatments known in the art. Commonly used 
proteases include enterokinase, tobacco etch virus (TEV), thrombin, and factor X a . 
GBS polysaccharides 

The compositions of the invention may be further improved by including GBS 
polysaccharides. Preferably, the GBS antigen and the saccharide each contribute to the immunological 
response in a recipient. The combination is particularly advantageous where the saccharide and 
polypeptide provide protection from different GBS serotypes. 

The combined antigens may be present as a simple combination where separate saccharide 
and polypeptide antigens are administered together, or they may be present as a conjugated 
combination, where the saccharide and polypeptide antigens are covalently linked to each other. 

Thus the invention provides an immunogenic composition comprising (i) one or more GBS 
AI proteins and (ii) one or more GBS saccharide antigens. The polypeptide and the polysaccharide 
may advantageously be covalently linked to each other to form a conjugate. 

Between them, the combined polypeptide and saccharide antigens preferably cover (or 

provide protection from) two or more GBS serotypes (e.g. 2, 3, 4, 5, 6, 7, 8 or more serotypes). The 

serotypes of the polypeptide and saccharide antigens may or may not overlap. For example, the 

polypeptide might protect against serogroup II or V, while the saccharide protects against either 

serogroups la, lb, or HI. Preferred combinations protect against the following groups of serotypes: 
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serotypes la and V, (6) serotypes la and VI, (7) serotypes la and VII, (8) serotypes la and VIII, (9) 
serotypes lb and II, (10) serotypes lb and III, (11) serotypes lb and IV, (12) serotypes lb and V, (13) 
serotypes lb and VI, (14) serotypes lb and VII, (15) serotypes lb and VIII, 16) serotypes II and III, 
5 (17) serotypes II and IV, (18) serotypes II and V, (19) serotypes II and VI, (20) serotypes II and VII, 
(21) serotypes II and VII, (22) serotypes III and IV, (23) serotypes III and V, (24) serotypes III and 
VI, (25) serotypes III and VII, (26) serotypes III and VIII, (27) serotypes IV and V, (28) serotypes IV 
and VI, (29) serotypes IV and VII, (30) serotypes IV and VIII, (31) serotypes V and VI, (32) 
serotypes V and VII, (33) serotypes V and VIII, (34) serotypes VI and VII, (35) serotypes VI and 
1 0 VIII, and (36) serotypes VII and VIII. 

Still more preferably, the combinations protect against the following groups of serotypes: (1) 
serotypes la and II, (2) serotypes la and V, (3) serotypes lb and II, (4) serotypes lb and V, (5) 
serotypes III and II, and (6) serotypes III and V, Most preferably, the combinations protect against 
serotypes III and V, 

15 Protection against serotypes II and V is preferably provided by polypeptide antigens. 

Protection against serotypes la, lb and/or III may be polypeptide or saccharide antigens. 

Immunogenic compositions and medicaments 

Compositions of the invention are preferably immunogenic compositions, and are more 

preferably vaccine compositions. The pH of the composition is preferably between 6 and 8, 

20 preferably about 7. The pH may be maintained by the use of a buffer. The composition may be 

sterile and/or pyrogen-free. The composition may be isotonic with respect to humans. 

Vaccines according to the invention may either be prophylactic (i.e. to prevent infection) or 

therapeutic (i.e. to treat infection), but will typically be prophylactic. Accordingly, the invention 

includes a method for the therapeutic or prophylactic treatment of a Gram positive bacteria infection 

25 in an animal susceptible to such gram positive bacterial infection comprising administering to said 

animal a therapeutic or prophylactic amount of the immunogenic composition of the invention. For 

example, the invention includes a method for the therapeutic or prophylactic treatment of a 

Streptococcus agalactiae, S. pyogenes, or S. pneumoniae infection in an animal susceptible to 

streptococcal infection comprising administering to said animal a therapeutic or prophylactic amount 

30 of the immunogenic compositions of the invention. 

The invention also provides a composition of the invention for use of the compositions 

described herein as a medicament. The medicament is preferably able to raise an immune response in 

a mammal (i.e. it is an immunogenic composition) and is more preferably a vaccine. 

The invention also provides the use of the compositions of the invention in the manufacture of 

35 a medicament for raising an immune response in a mammal. The medicament is preferably a vaccine. 

The invention also provides kits comprising one or more containers of compositions of the 

invention. Compositions can be in liquid form or can be lyophilized, as can individual antigens. 

Suitable containers for the compositions include, for example, bottles, vials, syringes, and test tubes. 
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a sterile access port (for example, the container may be an intravenous solution bag or a vial having a 
stopper pierceable by a hypodermic injection needle). The composition may comprise a first 
component comprising one or more Gram positive bacteria AI proteins. Preferably, the AI proteins 
5 are surface AI proteins. Preferably, the AI surface proteins are in an oligomeric or hyperoligomeric 
form. For example, the first component comprises a combination of GBS antigens or GAS antigens, 
or S. pneumoniae antigens. Preferably said combination includes GBS 80. Preferably GBS 80 is 
present in an oligomeric or hyperoligomeric form. 

The kit can further comprise a second container comprising a pharmaceutically-acceptable 
10 buffer, such as phosphate-buffered saline, Ringer's solution, or dextrose solution. It can also contain 
other materials useful to the end-user, including 'other buffers, diluents, filters, needles, and syringes. 
The kit can also comprise a second or third container with another active agent, for example an 
antibiotic. 

The kit can also comprise a package insert containing written instructions for methods of 
15 inducing immunity against S agalactiae andor S. pyogenes and/or S pneumoniae or for treating S 
agalactiae andor S. pyogenes and/or S pneumoniae infections. The package insert can be an 
unapproved draft package insert or can be a package insert approved by the Food and Drug 
Administration (FDA) or other regulatory body. 

The invention also provides a delivery device pre- filled with the immunogenic compositions 
20 of the invention. 

The invention also provides a method for raising an immune response in a mammal 
comprising the step of administering an effective amount of a composition of the invention. The 
immune response is preferably protective and preferably involves antibodies and/or cell-mediated 
immunity. This immune response will preferably induce long lasting (e.g., neutralising) antibodies 
25 and a cell mediated immunity that can quickly respond upon exposure to one or more GBS and/or 
GAS and/or S. pneumoniae antigens. The method may raise a booster response. 

The invention provides a method of neutralizing GBS, GAS, or S. pneumoniae infection in a 
mammal comprising the step of administering to the mammal an effective amount of the 
immunogenic compositions of the invention, a vaccine of the invention, or antibodies which recognize 
30 an immunogenic composition of the invention. 

The mammal is preferably a human. Where the vaccine is for prophylactic use, the human is 
preferably a female (either of child bearing age or a teenager). Alternatively, the human may be 
elderly (e.g., over the age of 50, 55, 60, 65, 70 or 75) and may have an underlying disease such as 
diabetes or cancer. Where the vaccine is for therapeutic use, the human is preferably a pregnant 
35 female or an elderly adult. 

These uses and methods are preferably for the prevention and/or treatment of a disease caused 
by Streptococcus agalactiae, or S. pyogenes, or & pneumoniae. The compositions may also be 



-233- 



WO 2006/078318 PCT/US2005/027239 

effjjf<jjtiy^|.a^^ The compositions may also be effective against other 

Gram positive bacteria. 

One way of checking efficacy of therapeutic treatment involves monitoring Gram positive 
bacterial infection after administration of the composition of the invention. One way of checking 
" 5 efficacy of prophylactic treatment involves monitoring immune responses against the Gram positive 
bacterial antigens in the compositions of the invention after administration of the composition. 

One way of checking efficacy of therapeutic treatment involves monitoring GBS infection 
after administration of the composition of the invention. One way of checking efficacy of 
prophylactic treatment involves monitoring immune responses against the GBS antigens in the 
1 0 compositions of the invention after administration of the composition. 

A way of assessing the immunogenicity of the component proteins of the immunogenic 
compositions of the present invention is to express the proteins recombinantly and to screen patient 
sera or mucosal secretions by immunoblot. A positive reaction between the protein and the patient 
serum indicates that the patient has previously mounted an immune response to the protein in 
1 5 question- that is, the protein is an immunogen. This method may also be used to identify 
immunodominant proteins and/or epitopes. 

Another way of checking efficacy of therapeutic treatment involves monitoring GBS or GAS 
or S pneumoniae infection after administration of the compositions of the invention. One way of 
checking efficacy of prophylactic treatment involves monitoring immune responses both systemically 
20 (such as monitoring the level of IgGl and IgG2a production) and mucosally (such as monitoring the 
level of IgA production) against the GBS and/or "GAS and/or S pneumoniae antigens in the 
compositions of the invention after administration of the composition. Typically, GBS and/or GAS 
and/or S pneumoniae serum specific antibody responses are determined post-immunization but pre- 
challenge whereas mucosal GBS and/or GAS and/or S pneumoniae specific antibody body responses 
25 are determined post-immunization and post-challenge. 

The vaccine compositions of the present invention can be evaluated in in vitro and in vivo 
animal models prior to host, e.g., human, administration. 

The efficacy of immunogenic compositions of the invention can also be determined in vivo by 
challenging animal models of GBS and/or GAS and/or S pneumoniae infections, e.g., guinea pigs or 
30 mice, with the immunogenic compositions. The immunogenic compositions may or may not be 
derived from the same serotypes as the challenge serotypes. Preferably the immunnogenic 
compositions are derivable from the same serotypes as the challenge serotypes. More preferably, the 
immunogenic composition and/or the challenge serotypes are derivable from the group of GBS and/or 
GAS and/or S pneumoniae serotypes. 
35 In vivo efficacy models include but are not limited to: (i) A murine infection model using 

human GBS and/or GAS and/or S pneumoniae serotypes; (ii) a murine disease model which is a 
murine model using a mouse- adapted GBS and/or GAS and/or S pneumoniae strain, such as those 
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strains lt |ntjin^ virulent in mice and (iii) a primate model using human 

GBS or GAS or S pneumoniae isolates. 

The immune response may be one or both of a TH1 immune response and a TH2 response. 

The immune response may be an improved or an enhanced or an altered immune response. 

The immune response may be one or both of a systemic and a mucosal immune response. 

Preferably the immune response is an enhanced system and/or mucosal response. 

An enhanced systemic and/or mucosal immunity is reflected in an enhanced TH1 and/or TH2 
immune response. Preferably, the enhanced immune response includes an increase in the production 
of IgGl and/or IgG2a and/or IgA 

Preferably the mucosal immune response is a TH2 immune response. Preferably, the mucosal 
immune response includes an increase in the production of IgA. 

Activated TH2 cells enhance antibody production and are therefore of value in responding to 
extracellular infections. Activated TH2 cells may secrete one or more of IL-4, IL-5, IL-6, and IL-10. 
A TH2 immune response may result in the production of IgGl, IgE, IgA and memory B cells for 
future protection. 

A TH2 immune response may include one or more of an increase in one or more of the 
cytokines associated with a TH2 immune response (such as IL-4, IL-5, IL-6 and IL-10), or an increase 
in the production of IgGl, IgE, IgA and memoiy B cells. Preferably, the enhanced TH2 immune 
resonse will include an increase in IgGl production. 

A TH1 immune response may include one or more of an increase in CTLs, an increase in one 
or more of the cytokines associated with a TH1 immune response (such as IL-2, IFNy, and TNFp), an 
increase in activated macrophages, an increase in NK activity, or an increase in the production of 
IgG2a. Preferably, the enhanced TH1 immune response will include an increase in IgG2a production. 

Immunogenic compositions of the invention, in particular, immunogenic composition 
comprising one or more GAS antigens of the present invention may be used either alone or in 
combination with other GAS antigens optionally with an immunoregulatory agent capable of eliciting 
a Thl and/or Th2 response. 

Compositions of the invention will generally be administered directly to a patient. Certain 
routes may be favored for certain compositons, as resulting in the generation of a more effective 
immune response, preferably a CMI response, or as being less likely to induce side effects, or as being 
easier for administration. Direct delivery may be accomplished by parenteral injection (e.g. 
subcutaneously, intraperitoneally, intradermally, intravenously, intramuscularly, or to the interstitial 
space of a tissue), or by rectal, oral (e.g. tablet, spray), vaginal, topical, transdermal (e.g. see WO 
99/27961) or transcutaneous (e.g. see WO 02/074244 and WO 02/064162), intranasal (e.g. see 
WO03/028760), ocular, aural, pulmonary or other mucosal administration. 

The invention may be used to elicit systemic and/or mucosal immunity. 
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j t .»,j . |n one pgrifcij^ embodiment, the immunogenic composition comprises one or 

more GBS or GAS or S pneumoniae antigen(s) which elicits a neutralising antibody response and one 
or more GBS or GAS or S pneumoniae antigen(s) which elicit a cell mediated immune response. In 
this way, the neutralising antibody response prevents or inhibits an initial GBS or GAS or S 
5 pneumoniae infection while the cell-mediated immune response capable of eliciting an enhanced Thl 
cellular response prevents further spreading of the GBS or GAS or S pneumoniae infection. 
Preferably, the immunogenic composition comprises one or more GBS or GAS or S pneumoniae 
surface antigens and one or more GBS or GAS or S pneumoniae cytoplasmic antigens. Preferably the 
immunogenic composition comprises one or more GBS or GAS or S pneumoniae surface antigens or 
10 the like and one or other antigens, such as a cytoplasmic antigen capable of eliciting a Thl cellular 
response. 

Dosage treatment can be a single dose schedule or a multiple dose schedule. Multiple doses 
may be used in a primary immunisation schedule and/or in a booster immunisation schedule. In a 
multiple dose schedule the various doses may be given by the same or different routes e.g. a 

1 5 parenteral prime and mucosal boost, a mucosal prime and parenteral boost, etc. 

The compositions of the invention may be prepared in various forms. For example, the 
compositions may be prepared as injectables, either as liquid solutions or suspensions. Solid forms 
suitable for solution in, or suspension in, liquid vehicles prior to injection can also be prepared {e.g. a 
lyophilised composition). The composition may be prepared for topical administration e.g. as an 

20 ointment, cream or powder. The composition may be prepared for oral administration e.g. as a tablet 
or capsule, as a spray, or as a syrup (optionally flavoured). The composition may be prepared for 
pulmonary administration e.g. as an inhaler, using a fine powder or a spray. The composition may be 
prepared as a suppository or pessary. The composition may be prepared for nasal, aural or ocular 
administration e.g. as drops. The composition may be in kit form, designed such that a combined 

25 composition is reconstituted just prior to administration to a patient. Such kits may comprise one or 
more antigens in liquid form and one or more lyophilised antigens. 

Immunogenic compositions used as vaccines comprise an immunologically effective amount 
of antigen(s), as well as any other components, such as antibiotics, as needed. By 'immunologically 
effective amount', it is meant that the administration of that amount to an individual, either in a single 

30 dose or as part of a series, is effective for treatment or prevention, or increases a measurable immune 
response or prevents or reduces a clinical symptom. This amount varies depending upon the health 
and physical condition of the individual to be treated, age, the taxonomic group of individual to be 
treated (e.g. non-human primate, primate, etc.), the capacity of the individual's immune system to 
synthesise antibodies, the degree of protection desired, the formulation of the vaccine, the treating 

35 doctor's assessment of the medical situation, and other relevant factors. It is expected that the amount 
will fall in a relatively broad range that can be determined through routine trials. 
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The composition of the invention will typically, in addition to the components mentioned 

above, comprise one or more 'pharmaceutically acceptable carriers', which include any carrier that 

does not itself induce the production of antibodies harmful to the individual receiving the 

composition. Suitable carriers are typically large, slowly metabolised macromolecules such as 

proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid 

copolymers, and lipid aggregates (such as oil droplets or liposomes). Such carriers are well known to 

those of ordinary skill in the art. The vaccines may also contain diluents, such as water, saline, 

glycerol, etc. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering 

substances, and the like, may be present. A thorough discussion of pharmaceutically acceptable 

excipients is available in Gennaro (2000) Remington: The Science and Practice of Pharmacy. 20th 

ed., ISBN: 0683306472. 

Adjuvants 

Vaccines of the invention may be administered in conjunction with other immunoregulatory 
agents. In particular, compositions will usually include an adjuvant. Adjuvants for use with the 
invention include, but are not limited to, one or more of the following set forth below: 

A. Mineral Containing Compositions 

Mineral containing compositions suitable for use as adjuvants in the invention include 
mineral salts, such as aluminum salts and calcium salts. The invention includes mineral salts such as 
hydroxides {e.g. oxyhydroxides), phosphates {e.g. hydroxyphosphates, orthophosphates), sulfates, etc. 
{e.g. see chapters 8 & 9 of Vaccine Design... (1995) eds. Powell & Newman. ISBN: 030644867X. 
Plenum.), or mixtures of different mineral compounds {e.g. a mixture of a phosphate and a hydroxide 
adjuvant, optionally with an excess of the phosphate), with the compounds taking any suitable form 
{e.g. gel, crystalline, amorphous, etc.), and with adsorption to the salt(s) being preferred. The mineral 
containing compositions may also be formulated as a particle of metal salt (WO 00/23 105). 

Aluminum salts may be included in vaccines of the invention such that the dose of Al 3+ is 
between 0.2 and 1.0 mg per dose. 
B. Oil-Emulsions 

Oil-emulsion compositions suitable for use as adjuvants in the invention include squalene-water 
emulsions, such as MF59 (5% Squalene, 0.5% Tween 80, and 0.5% Span 85, formulated into 
submicron particles using a microfluidizer). See WO90/14837. See also, Podda, "The adjuvanted 
influenza vaccines with novel adjuvants: experience with the MF5 9- adjuvanted vaccine", Vaccine 
(2001) 19: 2673-2680; Frey et al., "Comparison of the safety, tolerability, and immunogenicity of a 
MF5 9 - adj u v anted influenza vaccine and a non-adjuvanted influenza vaccine in non-elderly adults", 
Vaccine (2003) 21 :4234-4237. MF59 is used as the adjuvant in the FLUAD™ influenza virus 
trivalent subunit vaccine. 
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emulsions. Preferred submicron oil-in-water emulsions for use herein are squalene/water emulsions 
optionally containing varying amounts of MTP-PE, such as a submicron oil-in-water emulsion 
containing 4-5% w/v squalene, 0.25-1.0% w/v Tween 80 ™ (polyoxyelthylenesorbitan monooleate), 
5 and/or 0.25-1.0%o Span 85™ (sorbitan trioleate), and, optionally, N-acetylmuramyl-L-aianyl-D- 

isogluatminyl-L^alanine-2-(r-2 , -dipalmitoyl-^w-glycero-3-huydroxyphosphophoi-yloxy)-ethylamine 
(MTP-PE), for example, the submicron oil-in-water emulsion known as "MF59" (International 
Publication No. WO 90/14837; US Patent Nos. 6,299,884 and 6,45 1,325, incorporated herein by 
reference in their entireties; and Ott et aL, "MF59 — Design and Evaluation of a Safe and Potent 

10 Adjuvant for Human Vaccines" in Vaccine Design: The Subunit and Adjuvant Approach (Powell, 
M.F. and Newman, M.J. eds.) Plenum Press, New York, 1995, pp. 277-296). MF59 contains 4-5% 
w/v Squalene (e.g. 4.3%), 0.25-0.5% w/v Tween 80™, and 0.5% w/v Span 85™ and optionally 
contains various amounts of MTP-PE, formulated into submicron particles using a microfluidizer such 
as Model 1 10Y microfluidizer (Microfluidics, Newton, MA). For example, MTP-PE may be present 

15 in an amount of about 0-500 ug/dose, more preferably 0-250 ug/dose and most preferably, 0-100 
j-ig/dose. As used herein, the term "MF59-0" refers to the above submicron oil-in-water emulsion 
lacking MTP-PE, while the term MF59-MTP denotes a formulation that contains MTP-PE. For 
instance, "MF59-100" contains 100 ug MTP-PE per dose, and so on. MF69, another submicron oil-in- 
water emulsion for use herein, contains 4.3% w/v squalene, 0.25% w/v Tween 80™, and 0.75% w/v 

20 Span 85™ and optionally MTP-PE. Yet another submicron oil-in-water emulsion is MF75, also 

known as SAF, containing 10% squalene, 0.4% Tween 80™, 5% pluronic-blocked polymer L121, and 
thr-MDP, also microfluidized into a submicron emulsion. MF75-MTP denotes an MF75 formulation 
that includes MTP, such as from 100-400 jag MTP-PE per dose. 

Submicron oil-in-water emulsions, methods of making the same and immunostimulating 

25 agents, such as muramyl peptides, for use in the compositions, are described in detail in International 
Publication No. WO 90/14837 and US Patent Nos. 6,299,884 and 6,45 1,325, incorporated herein by 
reference in their entireties. 

Complete Freund's adjuvant (CFA) and incomplete Freund's adjuvant (IF A) may also be 
used as adjuvants in the invention. 

30 C. Saponin Formulations 

Saponin formulations, may also be used as adjuvants in the invention. Saponins are a 
heterologous group of sterol glycosides and triterpenoid glycosides that are found in the bark, leaves, 
stems, roots and even flowers of a wide range of plant species. Saponin from the bark of the Quillaia 
saponaria Molina tree have been widely studied as adjuvants. Saponin can also be commercially 

35 obtained from Smilax ornata (sarsaprilla), Gypsophilla paniculata (brides veil), and Saponaria 

qfficianalis (soap root). Saponin adjuvant formulations include purified formulations, such as QS21, 
as well as lipid formulations, such as ISCOMs. 
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Chromatography (HP-LC) and Reversed Phase High Performance Liquid Chromatography (RP- 
HPLC). Specific purified fractions using these techniques have been identified, including QS7, QS17, 
QS18, QS21, QH-A, QH-B and QH-C. Preferably, the saponin is QS21. A method of production of 
QS21 is disclosed in US Patent No. 5,057,540. Saponin formulations may also comprise a sterol, such 
as cholesterol (see W096/33739). 

Combinations of saponins and cholesterols can be used to form unique particles called 
Immunostimulating Complexs (ISCOMs). ISCOMs typically also include a phospholipid such as 
phosphatidylethanolamine or phosphatidylcholine. Any known saponin can be used in ISCOMs. 
Preferably, the ISCOM includes one or more of Quil A, QHA and QHC. ISCOMs are further 
described in EP0109942, WO 96/1 171 1 and WO 96/33739. Optionally, the ISCOMS may be devoid 
of additional detergent. See WO 00/07621. 

A review of the development of saponin based adjuvants can be found at Barr, et al., 
"ISCOMs and other saponin based adjuvants", Advanced Drug Delivery Reviews (1998) 32:247-271. 
See also Sjolander, et al., "Uptake and adjuvant activity of orally delivered saponin and ISCOM 
vaccines", Advanced Drug Delivery Reviews (1998) 32:321-338. 

D. Virosomes and Virus Like Particles (VLPs) 

Virosomes and Virus Like Particles (VLPs) can also be used as adjuvants in the invention. 
These structures generally contain one or more proteins from a virus optionally combined or 
formulated with a phospholipid.. They are generally non-pathogenic, non-replicating and generally do 
not contain any of the native viral genome. The viral proteins may be recombinantly produced or 
isolated from whole viruses. These viral proteins suitable for use in virosomes or VLPs include 
proteins derived from influenza virus (such as HA or NA), Hepatitis B virus (such as core or capsid 
proteins), Hepatitis E virus, measles virus, Sindbis virus, Rotavirus, Foot-and-Mouth Disease virus, 
Retrovirus, Norwalk virus, human Papilloma virus, HIV, RNA-phages, QB-phage (such as coat 
proteins), GA-phage, fr-phage, AP205 phage, and Ty (such as retrotransposon Ty protein pi). VLPs 
are discussed further in WO 03/024480, WO 03/02448 1, and Niikura et al., "Chimeric Recombinant 
Hepatitis E Virus-Like Particles as an Oral Vaccine Vehicle Presenting Foreign Epitopes 55 , Virology 
(2002) 293:273-280; Lenz et al., "Papillomarivurs-Like Particles Induce Acute Activation of 
Dendritic Cells", Journal of Immunology (2001) 5246-5355; Pinto, et al., "Cellular Immune 
Responses to Human Papillomavirus (HPV)-16 LI Healthy Volunteers Immunized with Recombinant 
HPV-16 LI Virus-Like Particles", Journal of Infectious Diseases (2003) 188:327-338; and Gerber et 
al., "Human Papillomavrisu Virus-Like Particles Are Efficient Oral Immunogens when 
Coadministered with Escherichia coli Heat-Labile Entertoxin Mutant R192G or CpG", Journal of 
Virology (2001) 75(10):4752-4760. Virosomes are discussed further in, for example, Gluck et al., 
"New Technology Platforms in the Development of Vaccines for the Future", Vaccine (2002) 20:B10 
-B16. Immunopotentiating reconstituted influenza virosomes (IRIV) are used as the subunit antigen 
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FadW 20 Suppl 5:B17-23} and the INFLUVAC PLUS™ product 
E. Bacterial or Microbial Derivatives 

Adjuvants suitable for use in the invention include bacterial or microbial derivatives such as: 

(1) Non- toxic derivatives of enterobacterial lipopolysaccharide (LPS) 

Such derivatives include Monophosphoryl lipid A (MPL) and 3-O-deacylated MPL (3dMPL). 
3dMPL is a mixture of 3 De-O-acylated monophosphoryl lipid A with 4, 5 or 6 acylated chains. A 
preferred "small particle" form of 3 De-O-acylated monophosphoryl lipid A is disclosed in EP 0 689 
454. Such "small particles" of 3dMPL are small enough to be sterile filtered through a 0.22 micron 
membrane (see EP 0 689 454). Other non-toxic LPS derivatives include monophosphoryl lipid A 
mimics, such as aminoalkyl glucosaminide phosphate derivatives e.g. RC-529. See Johnson et ah 
(1999) BioorgMed Chem Lett 9:2273-2278. 

(2) Lipid A Derivatives 

Lipid A derivatives include derivatives of lipid A from Escherichia coli such as OM-174. 
OM-174 is described for example in Meraldi et at, "OM-174, a New Adjuvant with a Potential for 
Human Use, Induces a Protective Response with Administered with the Synthetic C-Terminal 
Fragment 242-310 from the circumsporozoite protein of Plasmodium berghei", Vaccine (2003) 
21:2485-2491; and Pajak, et at, "The Adjuvant OM-174 induces both the migration and maturation of 
murine dendritic cells in vivo", Vaccine (2003) 21: 836-842. 

(3) Immunostimulatory oligonucleotides 

Immunostimulatory oligonucleotides suitable for use as adjuvants in the invention include 
nucleotide sequences containing a CpG motif (a sequence containing an unmethylated cytosine 
followed by guanosine and linked by a phosphate bond). Bacterial double stranded RNA or 
oligonucleotides containing palindromic or poly(dG) sequences have also been shown to be 
immunostimulatory. 

The CpG' s can include nucleotide modifications/analogs such as phosphorothioate 
modifications and can be double-stranded or single-stranded. Optionally, the guanosine may be 
replaced with an analog such as 2'-deox-y-7-deazaguanosine. See Kandimalla, et aL, "Divergent 
synthetic nucleotide motif recognition pattern: design and development of potent immunomodulatory 
oligodeoxyribonucleotide agents with distinct cytokine induction profiles", Nucleic Acids Research 
(2003) 31(9): 2393-2400; WO02/26757 and W099/62923 for examples of possible analog 
substitutions. The adjuvant effect of CpG oligonucleotides is further discussed inKrieg, "CpG motifs: 
the active ingredient in bacterial extracts?", Nature Medicine (2003) 9(7): 831-835; McCluskie, et aL, 
"Parenteral and mucosal prime-boost immunization strategies in mice with hepatitis B surface antigen 
and CpG DNA", FEMS Immunology and Medical Microbiology (2002) 32:179-185; WO98/40100; 
US Patent No. 6,207,646; US Patent No. 6,239,116 and US Patent No. 6,429,199. 

The CpG sequence may be directed to TLR9, such as the motif GTCGTT or TTCGTT. See 

Kandimalla, et aL, "Toll-like receptor 9: modulation of recognition and cytokine induction by novel 
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sptjx^ Transactions (2003) 31 (part 3): 654-658. The CpG 

sequence may be specific for inducing a Thl immune response, such as a CpG-A ODN, or it may be 
more specific for inducing a B cell response, such a CpG-B ODN. CpG- A and CpG-B ODNs are 
discussed in Blackwell, et aL, "CpG-A-Induced Monocyte IFN-gamma-Inducible Protein- 10 
Production is Regulated by Plasmacytoid Dendritic Cell Derived IFN-alpha", J. Immunol. (2003) 
170(8):406 1-4068; Krieg, "From A to Z on CpG", TRENDS in Immunology (2002) 23(2): 64-65 and 
WO01/95935. Preferably, the CpG is a CpG-A ODN. 

Preferably, the CpG oligonucleotide is constructed so that the 5' end is accessible for receptor 
recognition. Optionally, two CpG oligonucleotide sequences may be attached at their 3' ends to form 
"immunomers". See, for example, Kandimalla, et aL, "Secondary structures in CpG oligonucleotides 
affect immunostimulatory activity", BBRC (2003) 306:948-953; Kandimalla, et aL, "Toll-like 
receptor 9: modulation of recognition and cytokine induction by novel synthetic GpG DNAs", 
Biochemical Society Transactions (2003) 31(part 3):664-658; Bhagat et aL, "CpG penta- and 
hexadeoxyribonucleotides as potent immunomodulatory agents" BBRC (2003) 300:853-861 and WO 
03/035836. 

(4) ADP-ribosylating toxins and detoxified derivatives thereof. 

Bacterial ADP-ribosylating toxins and detoxified derivatives thereof may be used as 
adjuvants in the invention. Preferably, the protein is derived from E. coli (i.e., E. coli heat labile 
enterotoxin "LT), cholera ("CT"), or pertussis ("PT"). The use of detoxified ADP-ribosylating toxins 
as mucosal adjuvants is described in W095/1721 1 and as parenteral adjuvants in W098/42375. 
Preferably, the adjuvant is a detoxified LT mutant such as LT-K63, LT-R72, and LTR192G. The use 
of ADP-ribosylating toxins and detoxified derivaties thereof, particularly LT-K63 and LT-R72, as 
adjuvants can be found in the following references, each of which is specifically incorporated by 
reference herein in their entirety: Beignon, et aL, "The LTR72 Mutant of Heat-Labile Enterotoxin of 
Escherichia coli Enahnces the Ability of Peptide Antigens to Elicit CD4+ T Cells and Secrete Gamma ' 
Interferon after Coapplication onto Bare Skin", Infection and Immunity (2002) 70(6):30 12-30 19; 
Pizza, et aL, "Mucosal vaccines: non toxic derivatives of LT and CT as mucosal adjuvants", Vaccine 
(2001) 19:2534-2541; Pizza, et aL, "LTK63 and LTR72, two mucosal adjuvants ready for clinical 
trials" Int. J. Med. Microbiol (2000) 290(4-5) :455-461; Scharton-Kersten et aL, "Transcutaneous 
Immunization with Bacterial ADP-Ribosylating Exotoxins, Subunits and Unrelated Adjuvants", 
Infection and Immunity (2000) 68(9):5306-53 13; Ryan et aL, "Mutants of Escherichia coli Heat- 
Labile Toxin Act as Effective Mucosal Adjuvants for Nasal Delivery of an Acellular Pertussis 
Vaccine: Differential Effects of the Nontoxic AB Complex and Enzyme Activity on Thl and Th2 
Cells" Infection and Immunity (1999) 67(1 2): 6270-6280; Partidos et aL, "Heat-labile enterotoxin of 
Escherichia coli and its site-directed mutant LTK63 enhance the proliferative and cytotoxic T-cell 
responses to intranasally co-immunized synthetic peptides", Immunol. Lett. (1999) 67(3):209-216; 
Peppoloni et aL, "Mutants of the Escherichia coli heat-labile enterotoxin as safe and strong adjuvants 

for intranasal delivery of vaccines", Vaccines (2003) 2(2):285-293; and Pine et aL, (2002) "Intranasal 
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ijjjca^u^ detoxified mutant of heat labile entero toxin from 

Escherichia coli (LTK63)" J, Control Release (2002) 85(l-3):263-270. Numerical reference for amino 
acid substitutions is preferably based on the alignments of the A and B subunits of ADP-ribosylating 
* toxins set forth in Domenighini et al., Mol, Microbiol (1995) 15(6): 1 165-1 167, specifically 
5 incorporated herein by reference in its entirety. 

F. Bioadhesives and Mucoadhesives 

Bioadhesives and mucoadhesives may also be used as adjuvants in the invention. Suitable 
bioadhesives include esterified hyaluronic acid microspheres (Singh et al (2001) J. Cont. Rele. 
70:267-276) or mucoadhesives such as cross-linked derivatives of poly(acrylic acid), polyvinyl 
10 alcohol, polyvinyl pyrollidone, polysaccharides and carboxymethylcellulose. Chitosan and derivatives 
thereof may also be used as adjuvants in the invention. E.g."WO99/27960. 

G. Microparticles 

Microparticles may also be used as adjuvants in the invention. Microparticles {i.e. a particle 
of ~100nm to ~150um in diameter, more preferably ~200nm to ~30pm in diameter, and most 
15 preferably ~500nm to —lOum in diameter) formed from materials that are biodegradable and 

non-toxic {e.g. a poly(a-hydroxy acid), a polyhydroxybutyric acid, a polyorthoester, a polyanhydride, 
a polycaprolactone, etc.), with poly(lactide-eo-glycolide) are preferred, optionally treated to have a 
negatively-charged surface {e.g. with SDS) or a positively-charged surface {e.g. with a cationic 
detergent, such as CTAB). 
20 H. Liposomes 

Examples of liposome formulations suitable for use as adjuvants are described in US Patent 
No. 6,090,406, US Patent No. 5,916,588, and EP 0 626 169. 

/. Polyoxy ethylene ether and Polyoxy ethylene Ester Formulations 

Adjuvants suitable for use in the invention include polyoxyethylene ethers and 
25 polyoxyethylene esters. W099/52549. Such formulations further include polyoxyethylene sorbitan 
ester surfactants in combination with an octoxynol (WO01/21207) as well as polyoxyethylene alkyl 
ethers or ester surfactants in combination with at least one additional non-ionic surfactant such as an 
octoxynol (WO 01/21 152). 

Preferred polyoxyethylene ethers are selected from the following group: polyoxyethylene-9- 
30 lauryl ether (laureth 9), polyoxyethylene-9-steoryl ether, polyoxytheylene-8-steoryl ether, 

polyoxyethylene-4-lauryl ether, polyoxyethylene-3 5 -lauryl ether, and polyoxyethylene-23 -lauryl 
ether. 

J. Polyphosphazene (PCPP) 

PCPP formulations are described, for example, in Andrianov et al., "Preparation of hydrogel 
35 microspheres by coacervation of aqueous polyphophazene solutions", Biomaterials (1998) 19(1- 

3): 109-1 15 and Payne et al., "Protein Release from Polyphosphazene Matrices", Adv. Drug. Delivery 
Review (1998) 31(3): 185-1 96. 
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Examples of muramyl peptides suitable for use as adjuvants in the invention include N-acetyl- 
muramyl-L~threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-l-alanyl-d-isoglutaniine (nor- 
MDP), and N-acetyImuramyl-l-alanyl-d-isoglutaminyl~l-alanine-2-(r-2 , -dipalmitoyl-sn-glycero-3- 
5 hydroxyphosphoryloxy)-ethylamine MTP-PE). 

L. Imidazoquinolone Compounds. 

Examples of imidazoquinolone compounds suitable for use adjuvants in the invention include 
Imiquamod and its homologues, described further in Stanley, "Imiquimod and the imidazoquinolones: 
mechanism of action and therapeutic potential" Clin Exp Dermatol (2002) 27(7):57 1-577 and Jones, 
10 "Resiquimod 3M", Curr Opin Investig Drugs (2003) 4(2):214-218. 

The invention may also comprise combinations of aspects of one or more of the adjuvants 
identified above. For example, the following adjuvant compositions may be used in the invention: 

(1) a saponin and an oil-in- water emulsion (WO 99/1 1241); 

(2) a saponin (e.g.., QS21) + a non-toxic LPS derivative (e.g. 3dMPL) (see WO 94/00153); 
15 (3) a saponin (e.g.., QS21) + a non-toxic LPS derivative (e.g. 3dMPL) + a cholesterol; 

(4) a saponin (e.g. QS2 1 ) + 3 dMPL + IL- 1 2 (optionally + a sterol) (WO 98/57659); 

(5) combinations of 3 dMPL with, for example, QS21 and/or oil-in- water emulsions (See 
European patent applications 0835318, 0735898 and 0761231); 

(6) SAF, containing 10% Squalane, 0.4% Tween 80, 5% pluronic-block polymer L121, and thr- 
20 MDP, either microfluidized into a submicron emulsion or vortexed to generate a larger particle size 

emulsion. 

(7) Ribi™ adjuvant system (RAS), (Ribi Immunochem) containing 2% Squalene, 0.2% Tween 
80, and one or more bacterial cell wall components from the group consisting of monophosphorylipid 
A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS 

25 (Detox™); 

(8) one or more mineral salts (such as an aluminum salt) 4- a non-toxic derivative of LPS (such as 
3dPML). 

(9) one or more mineral salts (such as an aluminum salt) + an immunostimulatory oligonucleotide 
(such as a nucleotide sequence including a CpG motif). Combination No. (9) is a preferred adjuvant 

30 combination. 

M. Human Immunomodulators 

Human immunomodulators suitable for use as adjuvants in the invention include cytokines, 
such as interleukins (e.g. IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), interferons (e.g. interferon-y), 
macrophage colony stimulating factor, and tumor necrosis factor. 
35 Aluminum salts and MF59 are preferred adjuvants for use with injectable influenza vaccines. 

Bacterial toxins and bioadhesives are preferred adjuvants for use with mucosally-delivered vaccines, 
such as nasal vaccines. 



-243- 



WO 2006/078318 PCT/US2005/027239 

^-iFhp- jji^^a^L^|@6]aip^ <gojmpoj^tiogj^:;3jpf' the present invention may be administed in combination 
with an antibiotic treatment regime. In one embodiment, the antibiotic is administered prior to 
administration of the antigen of the invention or the composition comprising the one or more of the 
antigens of the invention. 

5 In another embodiment, the antibiotic is administered subsequent to the adminstration of the 

one or more antigens of the invention or the composition comprising the one or more antigens of the 
invention. Examples of antibiotics suitable for use in the treatment of the Steptococcal infections of 
the invention include but are not limited to penicillin or a derivative thereof or clindamycin or the 
like. 

10 Further antigens 

The compositions of the invention may further comprise one or more additional Gram 
positive bacterial antigens which are not associated with an AL Preferably, the Gram positive 
bacterial antigens that are not associated with an AI can provide protection across more than one 
serotype or strain isolate. For example, a first non-AI antigen, in which the first non-AI antigen is at 
15 least 90% {i.e., at least 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100%) homologous to the amino acid 
sequence of a second non-AI antigen, wherein the first and the second non-AI antigen are derived 
from the genomes of different serotypes of a Gram positive bacteria, may be further included in the 
compositions. The first non-AI antigen may also be homologous to the amino acid sequence of a 
third non-AI antigen, such that the first non-AI antigen, the second non-AI antigen, and the third non- 
20 AI antigen are derived from the genomes of different serotypes of a Gram positive bacteria. The first 
non-AI antigen may also be homologous to the amino acid sequence of a fourth non-AI antigen, such 
that the first non-AI antigen, the second non-AI antigen, the third non-AI antigen, and the fourth non- 
AI antigen are derived from the genomes of different serotypes of a Gram positive bacteria. 

The first non-AI antigen may be GBS 322. The amino acid sequence of GBS 322 across GBS 
25 strains from serotypes la, lb, II, III, V, and VIII is greater than 90%. Alternatively, the first non-AI 

antigen may be GBS 276. The amino acid sequence of GBS 276 across GBS strain from serotypes la, 
lb, II, III, V, and VIII is greater than 90%. Table 13 provides the percent amino acid sequence 
identity of GBS 322 and GBS 276 across different GBS strains and serotypes. 



Table 13: Conservation of GBS 322 and GBS 276 amino acid sequences 



Serotype 


Strains 


GBS 322 


GBS 276 






cGH 


%AA identity 


cGH 


%AA identity 


la 


090 


+ 


98.60 




97.90 


A909 


+ 


98.30 


+ 


97.90 


515 


4- 


98.80 


+ 


97.50 


DK1 


+ 




+ 




DK8 


+ 




+ 




Davis 






+ 




lb 


7357b 


+ 








H36B 




98.30 




97.80 


n 


18RS21 


+ 


100.00 


+ 


99.90 


DIQl 
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Serotype 


11 r H^te^ |p - 


it ...... „,„,, GBS 322 


GBS 276 




\i :::;:u ..' «.:... 




%AA identity 


cGH 


% AA identity 


ni 


NEM316 


+ 


100.00 




97.00 


COH31 


+ 




+ 




D136 


+ 




+ 




M732 


+ 


98.00 


4- 


100.00 


COH1 


+ 


98.30 


+ 


100.00 


M781 


+ 


98.30 




99.60 


No type 


CJB110 


+ 


98.60 


+ 


97.90 


1169NT 


+ 


97.40 


+ 


97.90 


V 


CJB111 


+ 


100.00 


+ 




2603 


+ 


100.00 


+ 


100.00 


vra 


JM130013 




100.00 




97.90 


SMU014 


+ 




+ 




total 


22/22 


98.28+/-0.4 


22/22 


98.44 +/-1. 094 



As an example, inclusion of a non-AI protein, GBS 322, in combination with AI antigens 
GBS 67, GBS 80, and GBS 104 provided protection to newborn mice in an active maternal 
immunization assay. 

Table 14: Active maternal immunization assay for a combination of fragments from GBS 322, GBS 80, GBS 104, and GBS 67 









CS (A A 


Aean) 


MIX=322+80+104+67 


pe 




GBS strains 


Type 


GBS 80 


GBS 67 


GBS 322 


alive/treated 


% protection 


alive/ treated 


% protection 


515 


la 


0 


409 


227 


39/40 


97 


6/40 


15 


7357b- 


Tb 


91 


316 


102 


19/30 


63 


1/30 


3 


DK21 


II 


0 


331 


416 


25/34 


73 


17/48 


35 


5401 


II 


170 


618 


135 


35/40 


87 


3/37 


8 


3050 


II 


43 


460 


188 


48/48 


100 


1/30 


3 


COH1 


III 


305 


0 


130 


36/36 


100 


7/40 


17 


M781 


III 


65 


0 


224 


30/40 


75 


4/39 


10 


2603 


V 


125 


105 


313 




Q2 


10/35 


28 


CJB111 


V 


370 


481 


63 


25/28 


89 


4/46 


9 


JM9130013 


vin 


597 


83 


143 


37/39 


95 


5/40 


12 


JMU071 


VIII 


556 


79 ! 


170 


44/50 


88 


18/50 


36 


NTH 69 


NT 


0 


443 | 


213 


12/32 


37 


11/35 


31 



In fact, the non-AI GBS 322 antigen may itself provide protection to newborn mice in an 
active maternal immunization assay. 
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" T&ble 16: Active maternal immunization assay for each of GBS 80 and GBS 322 antigens 



5 



! 




GBS 80 


GBS 322 ; 






FAC5 


Protection (% survival) 


FACS 


Protection (% survival) 


GBS strains 


Type 


A Mean 


■ — i — 

antigen > 


ctrl- 


A Mean 


antigen 


ctrl- 


/»TD1 1 1 


V 


370 


72 % 


40% 


63 


57% 


40% 




IU 


305 


76 % j 


10% 


"l30 


3% 


10% 


2603 


\J 


82 


22 % j 


34% 


313 


83 % 


34% 


7357b- 


lb 




36% 


34% 


102 


43% 


34% 


18RS21 


rr 


0 


15% j 


24% 


268 


84 % 


24% 


DK21 


ir 


0 


10% 


21% 


416 


67 % 


25% 


A909 


la 


0 


0% J 


14% 








090 


la 


0 


0% | 


0% 








H36B 


lb 






105 


34% 


32% 



Thus, inclusion of a non-AI protein in an immunogenic composition of the invention may provide 
increased protection a mammal. 

The immunogenic compositions comprising S. pneumonaie AI polypeptides may further 

15 secondary SP protein antigens which include (a) any of the SP protein antigens disclosed in WO 

. 02/077021 or U.S. provisional application , filed April 20, 2005 (Attorney Docket 

Number 00244 LOO 154), (2) immunogenic portions of the antigens comprising at least 7 
contiguous amino acids, (3) proteins comprising amino acid sequences which retain 
immunogenicity and which are at least 95% identical to these SP protein antigens (e.g., 95%, 

20 96%, 97%, 98%, 99%, or 99.5% identical), and (4) fusion proteins, including hybrid SP protein 
antigens, comprising (l)-(3). 

Alternatively, the invention may include an immunogenic composition comprising a first and 
a second Gram positive bacteria non-AI protein, wherein the polynucleotide sequence encoding the 
sequence of the first non-AI protein is less than 90% (i.e., less than 90, 88, 86, 84, 82, 81, 78, 76, 74, 

25 72, 70, 65, 60, 55, 50, 45, 40, 35, or 30 percent) homologous than the corresponding sequence in the 
genome of the second non-AI protein. 

The compositions of the invention may further comprise one or more additional non-Gram 
positive bacterial antigens, including additional bacterial, viral or parasitic antigens. The 
compositions of the invention may further comprise one or more additional non-GBS antigens, 

30 including additional bacterial, viral or parasitic antigens. 

In another embodiment, the GBS antigen combinations of the invention are combined with 
one or more additional, non-GBS antigens suitable for use in a vaccine designed to protect elderly or 
immunocomprised individuals. For example, the GBS antigen combinations may be combined with 
an antigen derived from the group consisting of Enterococcus faecalis, Staphylococcus aureus, 

35 Staphylococcus epidermis, Pseudomonas aeruginosa, Legionella pneumophila, Listeria 
monocytogenes, Neisseria meningitides, influenza, and Parainfluenza virus ('PIV'). 
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p r 'fs^ ISISf^^lff iS^fe^SP^ antigen is used, it is preferably conjugated to a earner 
protein in order to enhance immunogenicity {e.g. Ramsay et al. (2001) Lancet 357(9251):195-196; 
Lindberg (1999) Vaccine 17 Suppl 2:S28-36; Buttery & Moxon (2000) JR Coll Physicians Lond 
34:163-168; Ahmad & Chapnick (1999) Infect Dis Clin North Am 13:1 13-133, vii.; Goldblatt (1998) 
J. Med. Microbiol 47:563-567; European patent 0 477 508; US Patent No. 5,306,492; International 
patent application W098/42721; Conjugate Vaccines (eds. Cruse et al) ISBN 3805549326, 
particularly vol. 10:48-1 14; and Hermanson (1996) Bioconjugate Techniques ISBN: 0123423368 or 
012342335X}. Preferred carrier proteins are bacterial toxins or toxoids, such as diphtheria or tetanus 
toxoids. The CRM 197 diphtheria toxoid is particularly preferred {Research Disclosure, 453077 (Jan 
2002)} . Other carrier polypeptides include the Kmeningitidis outer membrane protein (EP-A- 
0372501), synthetic peptides (EP-A-0378881; EP-A-0427347), heat shock proteins (WO 93/17712; 
WO 94/03208), pertussis proteins (WO 98/58668; EP A 0471 177), protein D from H.influenzae (WO 
00/56360), cytokines (WO 91/01 146), lymphokines, hormones, growth factors, toxin A or B from 
Cdifficile (WO00/61761), iron-uptake proteins (WO01/72337), etc. Where a mixture comprises 
capsular saccharides from both serogroups A and C, it may be preferred that the ratio (w/w) of MenA 
saccharide:MenC saccharide is greater than 1 (e.g. 2:1, 3:1, 4:1, 5:1, 10:1 or higher). Different 
saccharides can be conjugated to the same or different type of carrier protein. Any suitable 
conjugation reaction can be used, with any suitable linker where necessary. 

Toxic protein antigens may be detoxified where necessary e.g. detoxification of pertussis 
toxin by chemical and/or genetic means. 

Where a diphtheria antigen is included in the composition it is preferred also to include 
tetanus antigen and pertussis antigens. Similarly, where a tetanus antigen is included it is preferred 
also to include diphtheria and pertussis antigens. Similarly, where a pertussis antigen is included it is 
preferred also to include diphtheria and tetanus antigens. 

Antigens in the composition will typically be present at a concentration of at least 1 ug/ml 
each. In general, the concentration of any given antigen will be sufficient to elicit an immune 
response against that antigen. 

As an alternative to using protein antigens in the composition of the invention, nucleic acid 
encoding the antigen may be used {e.g. refs. Robinson & Torres (1997) Seminars in Immunology 
9:271-283; Donnelly et al (1997) Annu Rev Immunol 15:617-648; Scott-Taylor & Dalgleish (2000) 
Expert Opin Investig Drugs 9:471-480; Apostolopoulos & Plebanski (2000) Curr Opin Mol Ther 
2:441-447; Ilan (1999) Curr Opin Mol Ther 1:116-120; Dubensky et al (2000) Mol Med 6:723-732; 
Robinson & Pertmer (2000) Adv Virus Res 55: 1-74; Donnelly et al (2000) Am J Respir Crit Care 
Med 162(4 Pt 2):S190-193; and Davis (1999) ML Sinai J. Med. 66:84-90}. Protein components of the 
compositions of the invention may thus be replaced by nucleic acid (preferably DNA e.g. in the form 
of a plasmid) that encodes the protein. 
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The term "comprising" means "including" as well as "consisting" e.g. a composition 
"comprising" X may consist exclusively of X or may include something additional e.g. X + Y. 
The term "about" in relation to a numerical value x means, for example, x±10%. 
5 References to a percentage sequence identity between two amino acid sequences means that, 

when aligned, that percentage of amino acids are the same in comparing the two sequences. This 
alignment and the percent homology or sequence identity can be determined using software programs 
known in the art, for example those described in section 7.7.18 of Current Protocols in Molecular 
Biology (F.M. Ausubel et aL, eds., 1987) Supplement 30. A preferred alignment is determined by the 
10 Smith- Waterman homology search algorithm using an affine gap search with a gap open penalty of 12 
and a gap extension penalty of 2, BLOSUM matrix of 62. The Smith- Waterman homology search 
algorithm is disclosed in Smith & Waterman (1981) Adv. AppL Math, 2: 482-489. 

The invention is further illustrated, without limitation, by the following examples. 
EXAMPLE 1: Binding of an Adhesin Island surface protein, GBS 80, to Fibrinogen and 
1 5 Fibronectin. 

This example demonstrates that an Adhesin Island surface protein, GBS 80 can bind to 
fibrinogen and fibronectin. 

An enzyme-linked immunosorbent assay (ELISA) was used to analyse the in vitro binding 
ability of recombinant GBS 80 to immobilized extra-cellular matrix (ECM) proteins but not to bovine 

20 serum albumin (BSA). Microtiter plates were coated with ECM proteins (fibrinogen, fibronectin, 
iaminin, collagen type IV) and binding assessed by adding varying concentrations of a recombinant 
form of GBS 80, over-expressed and purified from E. coli (FIGURE 5 A). Plates were then incubated 
sequentially with a) mouse anti-GBS 80 primary antibody; b) rabbit anti-mouse AP-conjugated 
secondary antibody; c) pNPP colorimetric substrate. Relative binding was measured by monitoring 

25 absorbance at 405 nm, using 595 nm as a reference wavelength. Figure 5b shows binding of 
recombinant GBS 80 to immobilized ECM proteins (1 jig) as a function of concentration of GBS 80. 
BSA was used as a negative control. Data points represent the means of OD 405 values ± standard 
deviation for 3 wells. 

Binding of GBS 80 to the tested ECM proteins was found to be concentration dependent and 
30 exhibited saturation kinetics. As is also evident from FIGURE 5, binding of GBS 80 to fibronectin 
and fibrinogen was greater than binding to Iaminin and collagen type IV at all the concentrations 
tested. 

EXAMPLE 2: GBS 80 is required for surface localization of GBS 104. 

This example demonstrates that co-expression of GBS 80 is required for surface localization 
35 of GBS 104. 

The polycistronic nature of the Adhesin Island I mRNA was investigated through reverse 

transcriptase-PCR (RT-PCR) analysis employing primers designed to detect transcripts arising from 

contiguous genes. Total RNA was isolated from GBS cultures grown to an optical density at 600 nm 
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(OD6go),of 0.3 b y the RNeasy Total RNA isolation method (Qiagen) 

accorfling to the manufacturer's instructions. The absence of contaminating chromosomal DNA was 
confirmed by failure of the gene amplification reactions to generate a product detectable by agarose 
gel electrophoresis, in the absence of reverse transcriptase. RT-PCR analysis was performed with the 
5 Access RT-PCR system (Promega) according to the manufacturer's instructions, employing PGR 
cycling temperatures of 60°C for annealing and 70°C for extension. Amplification products were 
visualized alongside 100-bp DNA markers in 2% agarose gels after ethidium bromide staining. 

FIGURE 5 shows that all the genes are co-transcribed as an operon. A schematic of the AI-1 
operon is shown above the agarose gel analysis of the RT-PCR products. Large rectangular arrows 

10 indicate the predicted transcript direction. Primer pairs were selected such as "1-4" cross the 3'finish- 
5'start of successive genes and overlap each gene by at least 200 bp. Additionally, "1" crosses a 
putative rho-independent transcriptional terminator. "5" is an internal GBS 80 control and "6" is an 
unrelated control from a highly expressed gene. Lanes: "a": RNA plus RTase enzyme; "b" RNA 
without RTase; "c": genomic DNA control. 

15 In the effort to elucidate the functions of the AI-1 proteins, in frame deletions of all of the 

genes within the operon have been constructed and the resulting mutants characterized with respect to 
surface exposure of the encoded antigens (see FIGURE 8). 

Each in- frame deletion mutation was constructed by splice overlap extension PCR (SOE- 
PCR) essentially as decribed by Horton et al. [Horton R. M., Z. L. Cai, S. N. Ho, L. R. Pease (1990) 

20 Biotechniques 8:528-35]. using suitable primers and cloned into the temperature sensitive shuttle 

vector pJRS233 to replace the wild type copy by allelic exchange [Perez-Casal, J., J. A. Price, et al. 
(1993) Mol Microbiol 8(5): 809-19.]. All plasmid constructions utilized standard molecular biology 
techniques, and the identities of DNA fragments generated by PCR were verified by sequencing. 
Following SOE-PCR, the resulting mutant DNA fragments were digested with Xhol and EcoRI, and 

25 ligated into a similarly digested pJRS233. The resuting vectors were introduced by electroporation 

into the chromosome of 2603 and COH1 GBS strains in a three-step process, essentially as described 
in Framson et al. [Framson, P. E., A. Nittayajarn, J. Merry, P. Youngman, and C. E. Rubens. (1997) 
Appl. Environ. Microbiol. 63(9):3539-47]. Briefly, the vector pJRS233 contains an erm gene 
encoding erythromycin resistance and a temperature-sensitive gram-positive replicon that is active at 

30 30°C but not at 37°C. Initially, the constructs are electroporated into GBS electro-competent cells 
prepared as described by Frameson et al., and transformants containing free plasmid are selected by 
their ability to grow at 30°C on Todd-Hewitt Broth (THB) agar plates containing 1 p,g/ml 
erythromycin. The second step includes a selection step for strains in which the plasmid has integrated 
into the chromosome via a single recombination event over the homologous plasmid insert and 

35 chromosome sequence by their ability to grow at 37°C on THB agar medium containing 1 mg/ml 

erythromycin. In the third step, GBS cells containing the plasmid integrated within the chromosome 
(integrants) are serially passed in broth culture in the absence of antibiotics at 30°C. Plasmid excision 
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elhet'completedthe aileiic exchange or reconstituted the wild-type genotype. Subsequent loss of 
plasmid in the absence of antibiotic selection pressure resulted in an erythromycin-sensitive 
phenotype. In order to assess gene replacement a screening of erythromycin-sensitive colonies was 
performed by analysis of the target gene PCR amplicons. 

FIGURE 7 reports a schematic of the IS-1 operon for each knock-out strain generated, along 
with the deletion position within the amino acidic sequence. Most data presented here concern the 
COH1 deletion strains, in which the expression of each of the antigens is higher by DNA microarray 
analysis (data not shown) as well as detectable by FACS analysis (see FIGURE 8). The double mutant 
in 2603 A80, A104 double mutant was constructed by sequential allelic exchanges of the shown 
alleles. 

Immunization protocol 

Immune sera for FACS experiments were obtained as follows. 

Groups of 4 CD-I outbred female mice 6-7 weeks old (Charles River Laboratories, Calco 
Italy) were immunized with the selected GBS antigens, (20 jug of each recombinant GBS antigen), 
suspended in 100 ul of PBS. Each group received 3 doses at days 0, 21 and 35. Immunization was 
performed through intra-peritoneal injection of the protein with an equal volume of Complete 
Freund's Adjuvant (CFA) for the first dose and Incomplete Freund's Adjuvant (IFA) for the following 
two doses. In each immunization scheme negative and positive control groups are used. Immune 
response was monitored by using serum samples taken on day 0 and 49. 

FACS analysis 

Preparation of paraformaldehyde treated GBS cells and their FACS analysis were carried out 
as follows. 

GBS serotype COH1 strain cells were grown in Todd Hewitt Broth (THB; Difco 
Laboratories, Detroit, Mich.) to OD600nm = 0.5. The culture was centrifuged for 20 minutes at 5000 
rpm and bacteria were washed once with PBS, resuspended in PBS containing 0.05% 
paraformaldehyde, and incubated for 1 hours at 37 °C and then overnight at 4°C. 50ul of fixed 
bacteria (OD600 0. 1) were washed once with PBS, resuspended in 20ul of Newborn Calf Serum, 
(Sigma) and incubated for 20 min. at room temperature. The cells were then incubated for 1 hour at 
4°C in lOOul of preimmune or immune sera, diluted 1:200 in dilution buffer (PBS, 20% Newborn 
Calf Serum, 0.1% BSA). After centrifugation and washing with 200jlx1 of washing buffer (0.1% BSA 
in PBS), samples were incubated for 1 hour at 4°C with 50^1 of R-Phicoeiytrin conjugated F(ab)2 
goat anti-mouse IgG (Jackson ImmunoResearch Laboratories; Inc.), diluted 1: 100 in dilution buffer. 
Cells were washed with 200ul of washing buffer and resuspended in 200ul of PBS. Samples were 
analysed using a FACS Calibur apparatus (Becton Dickinson, Mountain View, Calf.) and data were 
analyzed using the Cell Quest Software (Becton Dickinson). A shift in mean fluorescence intensity of 
> 75 channels compared to preimmune sera from the same mice was considered positive. This cutoff 
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against mock purified recombinant proteins from cultures of E. coli carrying the empty expression 
vector and included in every experiment. Artifacts due to bacterial lysis were excluded using antisera 
raised against 6 different known cytoplasmic proteins all of which were negative 
5 FACS data on COH1 single KO mutants for GBS 1 04 and GBS 80 indicated that GBS 80 is 

required for surface localization of GBS 104. 

As shown in FIGURE 8, GBS 104 is not surface exposed in the A80 strain (second column, 
bottom), but is present in the whole protein extracts (see FIGURE 10). Mean shift values suggest that 
GBS 104 is partially responsible for GBS 80 surface exposure (Mean shift of GBS 80 is reduced to 

10 -60% wild-type levels in A104), and that GBS 80 is over-expressed in the complemented strain (mean 
shift value -200% wild-type level). The A80/pGBS 80 strain contains the GBS 80 orf cloned in the 
shuttle-vector pAM401 (Wirth, R., F. Y. An, et al. (1986). J Bacteriol 165(3): 831-6). The vector 
alone does not alter the secretion pattern of GBS 104 (right column). FACS was performed on mid- 
log fixed bacteria with mouse polyclonal antibodies as indicated at left. Black peak is pre-immune 

15 sera, colored peaks are sera from immunized animals. 

EXAMPLE 3: Deletion of GBS 80 causes attenuation in vivo. 

This example demonstrates that deletion of GBS 80 causes attenuation in vivo, suggesting that 
this protein contributes to bacterial virulence. 

20 By using a mouse animal model, we studied the role of GBS 80 and GBS 104 in the virulence 

of S. agalactiae. 

Groups of ten outbred female mice 5-6 week weeks old (Charles River Laboratories, Calco 
Italy) were inoculated intraperitoneally with different dilutions of the mutant strains and LD50 (lethal 
dose 50) were calculated according to the method of Reed and Muench [Reed, L. J. and H. Muench 
25 (1938).The American Journal of Hygiene 27(3): 493-7]. As presented in the table below the number 
of colony forming units (cfu) counted for both the A80 and the A80, A104 double mutants is about 10 
fold higher when compared to the wild type strain suggesting that inactivation of GBS 80 but not GBS 
104 is responsible for an attenuation in virulence. This finding indicates that GBS 80 gene in the AI-1 
might contribute to virulence. 

30 Table Lethal dose 50% analysis of AI-1 mutants in the 2603 strain background. LD50s were 

performed by IP injection of female CD1 mice at an age of 5-6 weeks. LD50s were calculated by the 
method of Reed and Muench (8). 



GBS strain 



Wild Type 2603 



LD 5Q , cfu 



2x 10 8 



Number of Experiments 



A 104 mutant 



A80 mutant 



~2 x 10 8 
2.6 xlO 9 



1 



A80, A104 double mutant 



-2 x 10 9 



EXAMPLE 4: Effect of Adhesin Island Sortase Deletions on Surface Antigen Presentation 
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presentation. 

FACS analysis results set forth in FIGURE 9 show that a deletion in sortase S AG0648 
prevented GBS 104 from reaching the surface and slightly reduced the surface exposure of GBS 80 
5 (fourth panel; mean shift value -60% wild-type COH1). In the double sortase knock-out strain, 

neither antigen was surface exposed (far right panel). Either sortase alone was sufficient for GBS 80 
to arrive at the bacterial surface (third and fourth columns, top). No effect was seen on surface 
exposure of antigens GBS 80 or GBS 104 in the AGBS 52 strain. Antibodies derived from purified 
GBS 52 were either non-specific or were FACS negative for GBS 52 (data not shown). FACS 
10 analysis was performed as described above (see EXAMPLE 2). 

As shown in FIGURE 10, inactivation of GBS 80 has no effect on GBS 104 expression as 
much as GBS 104 knock out doesn't change the total amount GBS 80 expressed. The Western blot of 
whole protein extracts (strains noted above lanes) probed with anti-GBS 80 antisera is shown in panel 
A. Arrow indicates expected size of GBS 80 (60 kDa). GBS 80 antibodies recognize a doublet, the 
15 lower band is not present in AGBS 80 strains. Panel B shows a Western blot of whole protein extracts 
probed with anti-GBS 104 antisera. Arrow indicates expected size of GBS 104 (99.4 kDa). Protein 
extracts were prepared from the same bacterial cultures used for FACS (FIGURES 8 and 9). In 
conclusion, although GBS 104 does not arrive at the surface in the A80 strain by FACS (FIGURE 8, 
second column), it is present at approximately wild-type levels in the whole protein preps (B, second 
20 lane). Approximately 20 jj-g of each protein extract was loaded per lane. 
Western-blot analysis 

Aliquots of total protein extract mixed with SDS loading buffer (lx: 60 mM TRIS-HC1 pH 
6.8, 5% w/v SDS, 10% v/v glycerin, 0.1% Bromophenol Blue, 100 mM DTT) and boiled 5 minutes at 
95° C, were loaded on a 12.5% SDS-PAGE precast gel (Biorad). The gel is run using a SDS-PAGE 

25 running buffer containing 250 mM TRIS, 2.5 mM Glycine and 0.1 %SDS. The gel is electroblotted 
onto nitrocellulose membrane at 200 mA for 60 minutes. The membrane is blocked for 60 minutes 
with PBS/0.05 % Tween-20 (Sigma), 10% skimmed milk powder and incubated O/N at 4° C with 
PBS/0.05 % Tween 20, 1% skimmed milk powder, with the appropriate dilution of the sera. After 
washing twice with PBS/0.05 % Tween, the membrane is incubated for 2 hours with peroxidase- 

30 conjugated secondary anti-mouse antibody (Amersham) diluted 1:4000. The nitrocellulose is washed 
three times for 10 minutes with PBS/0.05 % Tween and once with PBS and thereafter developed by 
Opti-4CN Substrate Kit (Biorad). 

Example 5: Binding of Adhesin Island proteins to epithelial cells and effect of Adhesin Island 
proteins on capacity of GBS to adhere to epithelial cells. 

35 This example illustrates the binding of AI proteins to epithelial cells and the effect of AI 

proteins on the capacity of GBS to adhere to epithelial cells. , 

Applicants analysed whether recombinant AI surface proteins GBS 80 or GBS 104 would 

demonstrate binding to various epithelial cells in a FACS analysis. Applicants also analysed whether 
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deletion of AI surface, pro tems GR&.SQ.oxGBS 104 would effect the capacity of GBS to adhere to and 

p jP "TV" O fci U En / id! ./ id! d- ^ 
mvade ME180 cervical epithelial cells. 

As shown in Figure 28, deletion of GBS 80 sequence from GBS strain isolate 2603 (serotype 

V) did not affect the capacity of the mutated GBS to adhere to and invade ME180 cervical epithelial 

5 cells. Here ME180 cervical carcinoma epithelial cells were infected with wild type GBS 2603 or 

GBS 2603 A80 isogenic mutant. After two hours of infection, non-adherent bacteria were washed off 

and infection prolonged for a further two hours and four hours. In invasion experiments, after each 

time point, was followed by a two hour antibiotic treatment. Cells were then lysed with 1% saponin 

and lysates platedon TSA plates. As shown in Figure 28, there was little difference between the 

10 percent invasion or percent adhesion of wild type and mutant strains up to the four hour time point. 

Figure 30 repeats this experiment with both A 104 and A 80 mutants from a different strain 

isolate. Here, ME180 cervical carcinoma epithelial cells were infected with GBS strain isolate COH 

(serotype III) wild type or COH1 AGBS 104 or COH1 A80 isogenic mutant. After one hour of 

infection, non-adherent bacteria were washed off and the cells were lysed with 1% saponin. The 

15 lysates were plated on TSA plates. As shown in Figure 30, while there was little difference in the 

percent invasion, there was a significant decrease in the percent association of the A 104 mutant 

compared to both the wild type and A80 mutant. 

The affect of AI surface proteins on the ability of GBS to translocate through an epithelial 

monolayer was also analysed. As shown in Figure 3 1, a GBS 80 knockout mutant strain partially 

, 20 loses the ability to translocate through an epithelial monolayer. Here epithelial monolayers were 

inoculated with wildtype or knockout mutant in the apical chamber of a transwell system for two 

hours and then non-adherent bacteria were washed off. Infection was prolonged for a further two and 

four hours. Samples were taken from the media of the basolateral side and the number of colony 

forming unties measured. Transepithelial electrical resistance measured prior to and after infection 

25 gave comparable values, indicating the maintenance of the integrity of the monolayer. By the six 

hour time point, the A 80 mutants demonstrated a reduced percent transcytosis. 

A similar experiment was conducted with GBS 104 knock out mutants. Here, as shown in 

Figure 22, the A104 mutants also demonstrated a reduced percent transcytosis, indicating that the 

mutant strains translocate through an epithelial monolayer less efficiently than their isogenic wild type 

30 counterparts. 

Applicants also studied the effect of AI proteins on the capacity of a GBS strain to invade 
J774 macrophage-like cells. Here, J774 cells were infected with GBS COH1 wild type or COH1 
AGBS104 or COH1 AGBS80 isogenic mutants. After one hour of infection, non-adherent bacteria 
were washed off and intracellular bacteria were recovered at two, four and six hours post antibiotic 
35 treatment. At each time point, cells were lysed with 0.25% Triton X-100 and lysates plated on TSA 
plates. As shown in Figure 32, the A 104 mutant demonstrated a significantly reduced percent 
invasion compared to both the wild type and A80 mutant. 



-253- 



WO 2006/078318 PCT/US2005/027239 
Example 6; |gjge^^p^o-jf^^c)^^ comprising AI surface proteins GBS 80 and GBS 104. 

This example illustrates hyperoligomeric structures comprising AI surface proteins GBS 80 

and GBS 104. A GBS isolate COH1 (serotype III) was adapted to increase expression of GBS 80. 

Figure 34 presents a regular negative stain electron micrograph of this mutant; no pilus or 

5 hyperoligomeric structures are distinguishable on the surface of the bacteria. When the EM stain is 

based on anti-GBS 80 antibodies labelled with 10 or 20 nm gold particles, the presence of GBS 80 

throughout the hyperoligomeric structure is clearly indicated (Figures 36, 37 and 38). EM staining 

against GBS 104 (anti-GBS 104 antibodies labelled with 10 nm gold particles) also reveals the 

presence of GBS 104 primarily on or near the surface of the bacteria or potentially associated with 

10 bacterial peptidoglycans (Figure 39). Analysis of this same strain (over-expressing GBS 80) with a 

combination of both anti-GBS 80 (using 20 nm gold particles) and anti-GBS 104 (using 10 nm gold 

particles) reveals the presence of GBS 104 on the surface and within the hyperoligomeric structures 

(see Figures 40 and 41). 

15 Example 7: GBS 80 is necessary for polymer formation and GBS 104 and sortase SAG0648 are 
necessary for efficient pili assembly 

This example demonstrates that GBS 80 is necessary for formation of polymers and that GBS 
104 and sortase SAG0648 are necessary for efficient pili assembly. GBS 80 and GBS 104 polymeric 
assembly was systematically analyzed in Cohl strain single knock out mutants of each of the relevant 

20 coding genes in AM (GBS 80, GBS 104, GBS 52, sag0647, and sag0648). Figure 41 provides 

Western blots of total protein extracts (strains noted above lanes) probed with either anti-GBS 80 (left 
panel) sera or anti-GBS 104 sera (right panel) for each of these Cohl and Cohl knock out strains. 
(Cohl, wild type Cohl; A80, Cohl with GBS 80 knocked out; A104, Cohl with GBS 104 knocked 
out; A52, Cohl with GBS 52 knocked out; A647, Cohl with SAG0647 knocked out; A648, Cohl with 

25 SAG0648 knocked out, A647-8, Cohl with SAG0647 and SAG0648 knocked out; A80/pGBS80, 

Cohl with GBS 80 knocked out but complemented with a high copy number plasmid expressing GBS 
80. Asterisks identify the monomer of GBS 80 and GBS 104.) 

The smear of immunoreactive material observed in the wild type strain, along with its 
disappearance in A80 and A104 mutants, is consistent with the notion that such high molecular weight 

30 structures are composed of covalently linked (SDS-resistant) GBS 80 and GBS 104 subunits. The 
immunoblotting with both anti-GBS 80 (oc-GBS 80) and anti-GBS 104 (a-GBS 104) revealed that 
deletion of sortase SAG0648 also interferes with the assembly of high molecular weight species, 
whereas the knock out mutant of the second sortase (SAG0647), even if somehow reduced, still 
maintains the ability to form polymeric structures. 

35 Total extracts form GBS were prepared as follows. Bacteria were grown in 50 ml of Todd- 

Hewitt broth (Difco) to an OD 600 nm of 0.5-0.6 and successively pelleted. After two washes in PBS the 
pellet was resuspended and incubated 3 hours at 37°C with mutanolisin. Cells were then lysed with at 
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least three freezing-thawing cycles in dry ice and a 37°C bath. The lysate was then centrifuged to 

eliminate fhe' cellular Sebris^ana trie supernatant was quantified. Approximately 40 ug of each protein 
extract was separated on SDS-PAGE. The gel was then subjected to immunoblotting with mice 
antisera and detected with chemiluminescence. 

5 

Example 8: GBS 80 is polymerized by an AI-2 sortase 

This example illustrates that GBS 80 can be polymerized not only by AI-1 sortases, but also 
by AI-2 sortases. Figure 42 shows total cell extract immunoblots of GBS 515 strain, which lacks AI- 
1. The left panel, where an anti-GBS 67 sera was used, shows that GBS 67 from AI-2 is assembled 

10 into high-molecular weight-complexes, suggesting the fonnation of a second type of pilus. The same 
high molecular structure is observed when GBS 80 is highly expressed by reintroducing the gene 
within a plasmid (pGBS 80). By using anti-GBS 80 (right panel) sera on the same extracts, again it is 
observed that, with GBS 80 over expression (515/pGBS 80), a high-molecular weight structure is 
assembled. This implies that, in the absence of AI-1 sortases, AI-2 sortases (SAG1405 and SAG1406) 

15 can complement the lacking function, still being able to assemble GBS 80 in a pilus structure. 

Example 9: Cohl produces a high molecular weight molecule, the GBS 80 pilin 

This example illustrates that Cohl produces a high molecular weight molecule, greater than 
1000 kDa, which is the GBS 80 pilin. Figure 43 provides silver-stained electrophoretic gels that show 

20 that Cohl produces two macromolecules. One of these macromolecules disappears in the Cohl GBS 
80 knock out cells, but does not disappear in the Cohl GBS 52 knock out mutant cells. The last two 
lanes on the right were loaded with 1 5 times the amount loaded in the other lanes. This was done in 
order to be able to count the bands. By doing this, a conservative size estimate of the top bands was 
calculated by starting at 240 kDa and considering each of 14 higher bands as the result of consecutive 

25 additions of a GBS 80 monomer. 

Cohl, wild type Cohl; A80, Cohl cells with GBS 80 knocked out; A52, Cohl cells with GBS 
52 knocked out; A80/pGBS 80, Cohl cells with GBS 80 knocked out and complemented with a high 
copy number construct expressing GBS 80. 

30 Example 10. GBS 52 is a minor component of the GBS pilus 

This example illustrates that GBS 52 is present in the GBS pilus and is a minor component of 
the pilus. Figure 45 shows an immunoblot of total cell extracts from a GBS Cohl strain and a GBS 
Cohl strain knocked out for GBS 52 (A52). The total cell extracts were immunoblotted anti-GBS 80 
antisera (left) and anti-GBS 52 antisera (right). Immunoblotting was performed using a 3-8% Tris- 
35 acetate polyacrylamide gel (Invitrogen) which provided excellent separation of large molecular 

weight proteins (see figure 41). When the gel was incubated with anti-GBS 80 sera, the bands from 
the Cohl wild-type strain appeared shifted when compared to the A52 mutant. This observation 
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indicated a different size,. of ,th% pi Jus t p.ofemeric components in the two strains. When the same gel 

was stripped and incubated with anti-GBS 52 sera the high-molecular subunits in the Cohl wild-type 
strain showed similar molecular size of those in the correspondent lane in the left panel. These 
findings confirmed that GBS 52 is indeed associated with GBS 80 macro-molecular structures but 
5 represents a minor component of the GBS pilus. 

Example 11: Pilus structures are present in the supernatant of GBS bacterial cultures 

This example illustrates that the pilus structure assembled in Cohl GBS is present in the 
supernatant of a bacterial cell culture. Figure 46 shows an immunoblot where the protein extract of 

10 the supernatant from cultures of different GBS mutant strains (117 = Cohl GBS 80 knockout; 159= 
Cohl GBS 104 knockout; 202= Cohl GBS 52 knockout; 206= Cohl GBS sag0647 knockout; 208= 
Cohl GBS sag0648 knockout; 197= Cohl GBS sag0647/sag0648 knockout; 179= Cohl GBS 80 
knockout complemented with a high copy plasmid expressing GBS 80). GBS 80 antisera detects the 
presence of pilus structures in the appropriate Cohl strains. 

15 The protein extract was prepared as follows. Bacteria were grown in THB to an OD 6 oonm of 

0.5-0.6 and the supernatant was separated from the cells by centrifugation. The supernatant was then 
filtered (0 0.2 urn) and 1 ml was added with 60% TCA for protein precipitation. 
GBS pili were also extracted from the fraction of surface-exposed proteins in Cohl strain and its GBS 
80 knock out mutant as described hereafter. Bacteria were grown to an OD 6 oonm of 0.6 in 50 ml of 

20 THB at 37°C. Cells were washed once with PBS and the pellet was then resuspended in 0. 1 M KP04 
pH 6.2, 40% sucrose, 10 mM MgC12, 400U/ml mutanolysin and incubated 3 hours at 37°C. 
Protoplasts were separated by centrifugation and the supernatant was recovered and its protein content 
measured. 

In order to study the dynamics of pilus production during different growth phases, 1 ml 
25 supernatant of a culture at different OD 60 onm was TCA precipitated and loaded onto a 3-8% SDS- 

PAGE as described before. Figure 47 shows the corresponding Western blot with GBS 80 anti-sera. 
The first group of lanes (left five sample lanes) refer to a Cohl strain growth (OD 6 oonm are noted 
above the lanes) whereas the second group of lanes (right five samples) are from a GBS 80 knock out 
strain over expressing GBS 80. The experiment shows that pilus macromolecular structures can be 
30 found in the supernatant in all of the growth phases tested. 

Example 12: In GBS strain Cohl, only GBS 80 and a sortase (sag0647 or sag0648) is required 
for polymerization 

This example describes requirements for pilus formation in Cohl. Figure 48 shows a Western 
35 blot of total protein extracts (prepared as described before) using anti-GBS 80 sera on Cohl clones. 
(Cohl, wild type Cohl; A104, Cohl knocked out for GBS 104, A647, Cohl knocked out for sag0647, 
A648, Cohl knocked for sag0648, A647-8, Cohl knocked out for sag0647 and sag0648; 515, wild 
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tyg&tfg^iyd gtHfi} OSwMjjl PlKffll^-l ; p80 a high copy number plasmid which expresses GBS 

80.) The data show that only the double sortase mutant is unable to polymerize GBS 80 indicating that 
the 'conditio sine qua non 1 for pilus polymerization is the co-existence of GBS 80 with at least one 
sortase. This result leads to a reasonable assumption that SAG 1405 and SAG 1406 are responsible for 
polymerization in this strain. 

Example 13: GBS 80 can be expressed in L. lactis under its own promoter and terminator 
sequences 

This example demonstrates that L. lactis, a non-pathogenic bacterium, can express GBS AI 
polypeptides such as GBS 80. L. lactis M1363 (J, Bacteriol. 154 (1983): 1-9) was transformed with a 
construct encoding GBS 80. Briefly, the construct was prepared by cloning a DNA fragment 
containing the gene coding for GBS 80 under its own promoter and terminator sequences into plasmid 
pAM401 (a shuttle vector for E. coli and other Gram positive bacteria; J. Bacteriol. 163 (1986):83 1- 
836). Total extracts of the transformed bacteria in log phase were separated on SDS-PAGE, 
transferred to membranes, and incubated with antiserum against GBS 80. A polypeptide 
corresponding to the molecular weight of GBS 80 was detected in the lanes containing total extracts 
of L. lactis transformed with the GBS 80 construct. See Figures 133 A and 133B, lanes 6 and 7. This 
same polypeptide was not detected in the lane containing total extracts of L. lactis not transformed 
with the GBS 80 construct, lane 9. This example shows thatZ. lactis can express GBS 80 under its 
own promoter and terminator. 

Example 14: L. lactis modified to express GBS AI-1 under the GBS 80 promoter and 
terminator sequences expresses GBS 80 in polymeric structures 

This example demonstrates the ability of Z. lactis to express GBS AI-1 polypeptides and to 
incorporate at least some of the polypeptides into oligomers. L. lactis was transformed with a 
construct containing the genes encoding GBS AI-1 polypeptides. Briefly, the construct was prepared 
by cloning a DNA fragment containing the genes for GBS 80, GBS 52, SAG0647, SAG0648, and 
GBS 104 under the GBS 80 promoter and terminator sequences into construct pAM401. The 
construct was transformed into L. lactis Ml 363. Total extracts of log phase transformed bacteria were 
separated on reducing SDS-PAGE, transferred to membranes, and incubated with antiserum against 
GBS 80. A polypeptide with a molecular weight corresponding to the molecular weight of GBS 80 
was detected in the lanes containing L. lactis transformed with the GBS AI-1 encoding construct. See 
Figure 134, lane 2. In addition, the same lane also showed immunoreactivity of polypeptides having 
higher molecular weights than the polypeptide having the molecular weight of GBS 80. These higher 
molecular weight polypeptides are likely oligomers of GBS 80. Oligomers of similar molecular 
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w<|%lgS HJS|^|1J<31^ of the culture supernatant of the transformed!,, lactis. 

See lane 4 of Figure 135. Thus, this example shows that L. lactis transformed to express GBS AI-1 
can efficiently polymerize GBS 80 in the form of a pilus. This pilus structure can likely be purified 
from either the cell culture supernatant or cell extracts. 

5 

Example 15: Cloning and Expression of S. pneumoniae Sp0462 

This example describes the production of a clone encoding a Sp0462 polypeptide and 
expression of the clone. To produce a clone encoding Sp0462, the open reading frame encoding 
Sp0462 was amplified using primers that annealed within the full-length Sp0462 open reading frame 

10 sequence. Figure 150A provides a 893 amino acid sequence of Sp0462. The primers used to produce 
a clone encoding the Sp0462 polypeptide are shown in Figure 150B. These primers annealed to the 
nucleotide sequences encoding the amino acid residues indicated by underlining in Figure 150 A. 
Amplification of the open reading frame encoding Sp0462 using these primers produced the amplicon 
shown at lane 2 of the agarose gel provided in Figure 1 60. The Sp0462 clone encodes amino acid 

15 residues 38-862 of the 893 amino acid residue Sp0462 protein; the italicized residues in Figure 150A 
were eliminated. Figure 151 A provides a schematic depiction of the recombinant Sp0462 
polypeptide. Figure 15 IB shows a schematic depiction of the full-length Sp0462 polypeptide. Both 
the recombinant Sp0462 encoded by the clone and the full-length Sp0462 protein have two collagen 
binding protein type B (Cna B) domains and a von Hillebrand factor A (vWA) domain. The cloned 

20 recombinant Sp0462 lacks the LPXTG motif present in the full-length Sp0462 protein. Western blot 
analysis for expression of the Sp0462 clone did not result in detection of polypeptides with serum 
obtained from S. pneumoniae-infected patients (Figure 152A) or GBS 80 antiserum (Figure 152B). 

Example 16: Cloning and Expression of S. pneumoniae Sp0463 

25 This example describes the production of a clone encoding a Sp0463 polypeptide and 

detection of recombinant Sp0463 polypeptide expressed from the clone. To produce a clone encoding 

Sp0463, the open reading frame encoding Sp0463 was amplified using primers that annealed within 

the full-length Sp0463 open reading frame sequence. Figure 153 A provides a 665 amino acid 

sequence of Sp0463. The primers used to produce the clone encoding Sp0463 polypeptide are shown 

30 in Figure 153B. These primers annealed to the nucleotide sequences encoding the amino acid 

residues indicated by underlining in Figure 153 A. Amplification of the open reading frame encoding 

Sp0463 using these primers produced the amplicon shown at lane 3 of the agarose gel provided in 

Figure 160. The Sp0463 clone encodes amino acid residues 23-627 of the 665 amino acid residue 

Sp0463 protein; the italicized residues in Figure 153 A were eliminated. Figure 154A provides a 

35 schematic depiction of the recombinant Sp0463 polypeptide. Figure 154B shows a schematic 

depiction of the full-length Sp0463 polypeptide. Both the recombinant Sp0463 encoded by the clone 

and the full-length Sp0463 protein have a Cna B domain and an E box motif. The cloned recombinant 
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S I#fi If tip #«HERB P^f! FSfr full-length Sp0463 protein. Expression of the Sp0463 
clone resulted in the detection of a 60 kD polypeptide, the expected molecular weight of the 
recombinant Sp0463 polypeptide, by Western blot analysis. See Figure 155. 

Example 17: Cloning and Expression of S. pneumoniae Sp0464 

This example describes the production of a clone encoding a Sp0464 polypeptide and 
detection of recombinant Sp0464 polypeptide expressed from the clone. To produce a clone encoding 
Sp0464, the open reading frame encoding Sp0464 was amplified using primers that annealed either 
within the full-length Sp0464 open reading frame sequence. Figure 157A provides a 393 amino acid 
sequence of Sp0464. The primers used to produce a clone encoding the Sp0464 polypeptide are 
shown in Figure 157B. These primers annealed to the nucleotide sequences encoding the amino acid 
residues indicated by underlining in Figure 157A. Amplification of the open reading frame encoding 
Sp0464 using these primers produced the amplicon shown at lane 4 of the agarose gel provided in 
Figure 160. The Sp0464 clone encodes amino acid residues 19-356 of the 393 amino acid residue 
Sp0464 protein; the italicized residues in Figure 157A were eliminated. Figure 158A provides a 
schematic depiction of the recombinant Sp0464 polypeptide. Figure 158B shows a schematic 
depiction of the full-length Sp0464 polypeptide. Both the recombinant Sp0464 encoded by the clone 
and the full-length Sp0464 protein have two Cna B domains. The cloned recombinant Sp0464 lacks 
the LPXTG motif present in the full-length Sp0464 protein. Expression of the Sp0464 clone resulted 
in the detection of a 38 kD polypeptide, the expected molecular weight of the recombinant Sp0464 
polypeptide, by Western blot analysis. See Figure 159. 

Example 18: Intranasal Immunization of Mice with Recombinant L. lactis Expressing GBS 80 
and Subsequent Challenge 

This example describes a method of intranasally immunizing mice using L. lactis that express 
GBS 80. Intranasal immunization consisted of 3 doses at days 0, 14 and 28, each dose administered 
in three consecutive days. Each day, groups of 3 CD-I outbred female mice 6-7 weeks old (Charles 
River Laboratories, Calco Italy) were immunized intranasally with 10 9 or 10 10 CFU of the 
recombinant Lactococcus lactis suspended in 20 ul of PBS. In each immunization scheme negative 
(wild-type L. lactis) and positive (recombinant GBS 80) control groups were used. The immune 
response of the dams was monitored by using serum samples taken on day 0 and 49. The female mice 
were bred 2-7 days after the last immunization (at approximately t= 36 - 37), and typically had a 
gestation period of 21 days. Within 48 hours of birth, the pups were challenged via LP. with GBS in a 
dose approximately equal to an amount which would be sufficient to kill 90 % of immunized pups (as 
determined by empirical data gathered from PBS control groups). The GBS challenge dose is 
preferably administered in 50ml of THB medium. Preferably, the pup challenge takes place at 56 to 
61 days after the first immunization. The challenge inocula were prepared starting from frozen 
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CU |^iu If^fftftlPSf ^Pjlfif^ ! ^^l||P tion with THB P rior to use - Survival of pups was 
monitored for 5 days after challenge. 



Example 19: Subcutaneous Immunization of Mice with Recombinant L. lactis Expressing GBS 
80 and Subsequent Challenge 

This example describes a method of subcutaneous immunization mice using L. lactis that 
express GBS 80. Subcutaneous immunization consists of 3 doses at days 0, 14 and 28. Groups of 3 
CD-I outbred female mice 6-7 weeks old (Charles River Laboratories, Calco Italy) were injected 
subcutaneously with 10 9 or 10 10 CFU of the recombinant Lactococcus lactis suspended in 100 pi of 
PBS. In each immunization scheme, negative (wild-type L. lactis) and positive (recombinant GBS80) 
control groups were used. The immune response of the dams was monitored by using serum samples 
taken on day 0 and 49. The female mice were bred 2-7 days after the last immunization (at 
approximately t= 36 - 37), and typically had a gestation period of 21 days. Within 48 hours of birth, 
the pups were challenged via LP. with GBS in a dose approximately equal to an amount which would 
be sufficient to kill 90 % of immunized pups (as determined by empirical data gathered from PBS 
control groups). The GBS challenge dose is preferably administered in 50ml of THB medium. 
Preferably, the pup challenge takes place at 56 to 61 days after the first immunization. The challenge 
inocula were prepared starting from frozen cultures diluted to the appropriate concentration with THB 
prior to use. Survival of pups was monitored for 5 days after challenge. 

Example 20: Immunization of Mice with GAS AI polypeptides and Subsequent Intranasal 
Challenge 

This example describes a method of immunizing mice with GAS AI polypeptides and 
subsequently intranasally challenging the mice with GAS bacteria. Groups of 10 CD 1 female mice 
aged between 6 and 7 weeks are immunized with a combination of GAS antigens of the invention 
GAS 15, GAS 16, and GAS 18, (15 jig of each recombinant antigen, derived from Ml strain SF370) 
or Z. lactis expressing the Ml strain SF370 adhesin island, suspended in 100 ul of suitable solution. 
Each group receives 3 doses at days 0, 21 and 45. Immunization is performed through subcutaneous 
or intraperitoneal injection for the GAS 15, GAS 16, GAS 18 protein combination. The protein 
combination is administered with an equal volume of Complete Freund's Adjuvant (CFA) for the first 
dose and Incomplete Freund's Adjuvant (IFA) for the following two doses. Immunization is 
performed intranasally for theZ. lactis expressing the Ml strain SF370 adhesin island. In each 
immunization scheme negative and positive control groups are used. 

The negative control group for the mice immunized with the GAS 15, GAS 16, GAS 18 
protein combination included mice immunized with PBS. The negative control group for the mice 
immunized withZ. lactis expressing the Ml strain SF370 adhesin island, included mice immunized 
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wifh gtn'ff 1 yUgbffiPlB gq»Ef figfcg^ transformed with the pAM401 expression vector lacking any 
cloned adhesin island sequence. 

The positive control groups included mice immunized with purified Ml strain SF370 M 

protein. 

5 Immunized mice are then anaesthetized with Zoletil and challenged intranasaliy with a 25 jliL 

suspension containing 1.2 x 10 6 or 1.2 x 10 8 CFU of ISS 3348 in THB. Animals are observed daily 
and checked for survival. 

Example 21: Active Maternal Immunization Assay 

10 As used herein, an Active Maternal Immunization assay refers to an in vivo protection 

assay where female mice are immunized with the test antigen composition. The female mice are 
then bred and their pups are challenged with a lethal dose of GBS. Serum titers of the female 
mice during the immunization schedule are measured as well as the survival time of the pups 
after challenge. 

15 Mouse immunization 

Specifically, groups of 4 CD-I outbred female mice 6-8 weeks old (Charles River 
Laboratories, Calco Italy) are immunized with one or more GBS antigens, (20 ug of each 
recombinant GBS antigen), suspended in 100 ul of PBS. Each group receives 3 doses at days 0, 
21 and 35. Immunization is performed through intra-peritoneal injection of the protein with an 

20 equal volume of Complete Freund's Adjuvant (CFA) for the first dose and Incomplete Freund's 
Adjuvant (IF A) for the following two doses. In each immunization scheme negative and positive 
control groups are used. 

Immune response is monitored by using serum samples taken on day 0 and 49. The sera are 
analyzed as pools from each group of mice. 

25 Active maternal immunization 

A maternal immunization/neonatal pup challenge model of GBS infection was used to 
verify the protective efficacy of the antigens in mice. The mouse protection study was adapted 
from Rodewald et al. (Rodewald et aL J. Infect. Diseases 166, 635 (1992)). In brief, CD-I female 
mice (6-8 weeks old) were immunized before breeding, as described above. The mice received 20 

30 u.g of protein per dose when immunized with a single antigen and 60 jug of protein per dose (15 
p,g of each antigen) when immunized with the combination of antigens. Mice were bred 2-7 days 
after the last immunization. Within 48 h of birth, pups were injected intraperitoneally with 50 ul 
of GBS culture. Challenge inocula were prepared starting from frozen cultures diluted to the 
appropriate concentration with THB before use. In preliminary experiments (not shown), the 

35 challenge doses per pup for each strain tested were determined to cause 90% lethality. Survival of 
pups was monitored for 2 days after challenge. Protection was calculated as (percentage 
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depjdgDflfrgl" ycfrgfffl divided by percentage deadControl multiplied by 

100. Data were evaluated for statistical significance by Fisher's exact test. 



-262- 



WO 2006/078318 

Embodinaentstoffe 




TO. 7 E 3 9 



PCT/US2005/027239 



The invention encompasses, but is not limited to, the embodiments enumerated below. 



1. An immunogenic composition comprising a purified Group B Streptococcus (GBS) 



adhesin island (AI) polypeptide in oligomeric form. 



2. The immunogenic composition of embodiment 1 wherein the GBS AI polypeptide is 



selected from a GBS AI-1. 



3. The immunogenic composition of embodiment 1 wherein the GBS AI polypeptide is 



selected from a GBS AI-2. 

L An immunogenic composition comprising a purified Group B Streptococcus (GBS) 
adhesin island (AI) polypeptide in oligomeric form. 

2. The immunogenic composition of embodiment 1 wherein the GBS AI polypeptide is 
selected from a GBS AI-1. 

3. The immunogenic composition of embodiment 1 wherein the GBS AI polypeptide is 
selected from a GBS AI-2. 

4. The immunogenic composition of any of embodiments 1-3 wherein the GBS AI 
polypeptide comprises a sortase substrate motif. 

5. The immunogenic composition of embodiment 4 wherein the sortase substrate motif is an 
LPXTG motif. 

6. The immunogenic composition of embodiment 5 wherein the LPXTG motif is represented 
by the amino acid sequence XPXTG, wherein the X at amino acid position 1 is an L, an I, or an F and 
the X at amino acid position 3 is any amino acid residue. 

7. The immunogenic composition of any one of embodiments 1-3 wherein the GBS AI 
polypeptide affects the ability of GBS bacteria to adhere to epithelial cells. 

8. The immunogenic composition of any one of embodiments 1-3 wherein the GBS AI 
polypeptide affects the ability of GBS bacteria to invade epithelial cells. 

9. The immunogenic composition of any one of embodiments 1-3 wherein the GBS AI 
polypeptide affects the ability of GBS bacteria to translocate through an epithelial cell layer. 

10. The immunogenic composition of any one of embodiments 1-3 wherein the GBS AI 
polypeptide is capable of associating with an epithelial cell surface. 

11. The immunogenic composition of embodiment 10 wherein the associating with an 
epithelial cell surface is binding to the epithelial cell surface. 

12. The immunogenic composition of any of embodiments 1-3 wherein the GBS AI 
polypeptide is a full-length GBS AI protein. 

13. The immunogenic composition of any of embodiments 1-3 wherein the GBS AI 
polypeptide is a fragment of a full-length GBS AI protein. 

14. The immunogenic composition of embodiment 13 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the GBS AI protein. 
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P C '^/"IfeM of embodiment 2 wherein the GBS AI polypeptide is 

selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof 

16. The immunogenic composition of embodiment 3 wherein the GBS AI polypeptide is 
selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 01524, and 

5 fragments thereof. 

17. The immunogenic composition of embodiment 15 wherein the GBS AI polypeptide is 

GBS 80. 

18. The immunogenic composition of any of embodiments 1-3 or 15-17 wherein the 
oligomeric form is a hyperoligomer. 

10 19. The immunogenic composition of any of embodiments 1-3, or 15-17 further comprising a 

Gram positive bacterium antigen not associated with an AI. 

20. The immunogenic composition of embodiment 19 wherein the antigen is selected from 
the group consisting of GBS 322 and GBS 276. 

21. The immunogenic composition of embodiment 20 wherein the antigen is GBS 322. 

15 22. An immunogenic composition comprising a purified Gram positive bacteria adhesin 

island (AI) polypeptide in an oligomeric form. 

23. The immunogenic composition of embodiment 22 wherein the Gram positive bacteria is 
of a genus selected from the group consisting of Streptococcus, Enterococcus, Staphylococcus, or 
Listeria. 

20 24. The immunogenic composition of embodiment 23 wherein the Gram positive bacteria is 

of the genus Streptococcus. 

25. The immunogenic composition of any of embodiments 22-24 wherein the Gram positive 
bacteria AI polypeptide comprises a sortase substrate motif. 

26. The immunogenic composition of embodiment 25 wherein the sortase substrate motif is 
25 an LPXTG motif. 

27. The immunogenic composition of any one of embodiments 22-24 wherein the Gram 
positive bacteria AI polypeptide affects the ability of Gram positive bacteria to adhere to epithelial 
cells. 

28. The immunogenic composition of any one of embodiments 22-24 wherein the Gram 
30 positive bacteria AI polypeptide affects the ability of Gram positive bacteria to invade epithelial cells. 

29. The immunogenic composition of any one of embodiments 22-24 wherein the Gram 
positive bacteria AI polypeptide affects the ability of Gram positive bacteria to translocate through an 
epithelial cell layer. 

30. The immunogenic composition of any one of embodiments 22-24 wherein the Gram 
35 positive bacteria AI polypeptide is capable of associating with an epithelial cell surface. 

31. The immunogenic composition of embodiment 30 wherein the associating with an 
epithelial cell surface is binding to the epithelial cell surface. 
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P C "tVliFSroWl of an y of embodiments 22-24 wherein the Gram positive 

bacteria AI polypeptide is a full-length Gram positive bacteria AI protein, 

33. The immunogenic composition of any of embodiments 22-24 wherein the Gram positive 
bacteria AI polypeptide is a fragment of a full-length Gram positive bacteria AI protein. 

34. The immunogenic composition of embodiment 33 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the Gram positive bacteria AI protein. 

35. The immunogenic composition of embodiment 24 wherein the genus Streptococcus 
bacteria is Group A Streptococcus (GAS) bacteria and the Gram positive bacteria AI polypeptide is a 
GAS AI polypeptide. 

36. The immunogenic composition of embodiment 35 wherein the GAS AI polypeptide is 
selected from a GAS AI-1. 

37. The immunogenic composition of embodiment 35 wherein the GAS AI polypeptide is 
selected from a GAS AI-2. 

38. The immunogenic composition of embodiment 35 wherein the GAS AI polypeptide is 
selected from a GAS AI-3. 

39. The immunogenic composition of embodiment 35 wherein the GAS AI polypeptide is 
selected from a GAS AI-4. 

40. The immunogenic composition of any of embodiments 35-39 wherein the GAS AI 
polypeptide comprises a sortase substrate motif. 

41. The immunogenic composition of embodiment 40 wherein the sortase substrate motif is 
an LPXTG motif. 

42. The immunogenic composition of embodiment 41 wherein the LPXTG motif is 
represented by XXXXG, wherein the X at the first amino acid position is an L, a V, an E, or a Q, 
wherein the X at the second amino acid position is P if the X at the first amino acid position is an L, 
the X at the second amino acid position is a V if the X at the first amino acid position is an E or a Q, 
or the X at the second amino acid position is a V or a P if the X at the first amino acid position is a V, 
wherein the X at the third amino acid position is any amino acid residue, and wherein the X at the 
fourth amino acid position is a T if the X at the first amino acid position is a V, an E, or a Q, or the X 
at the fourth amino acid position is a T, an S, or an A if the X at the first amino acid position is an L. 

43. The immunogenic composition of any one of embodiments 35-39 wherein the GAS AI 
polypeptide affects the ability of GAS bacteria to adhere to epithelial cells. 

44. The immunogenic composition of any one of embodiments 35-39 wherein the GAS AI 
polypeptide affects the ability of GAS bacteria to invade epithelial cells. 

45. The immunogenic composition of any one of embodiments 35-39 wherein the GAS AI 
polypeptide affects the ability of GAS bacteria to translocate through an epithelial cell layer. 

46. The immunogenic composition of any one of embodiments 35-39 wherein the GAS AI 
polypeptide is capable of associating with an epithelial cell surface. 
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|P 'if 7 ;/' of embodiment 46 wherein the associating with an 

epithelial cell surface is binding to the epithelial cell surface. 

48. The immunogenic composition of any of embodiments 35-39 wherein the GAS AI 
polypeptide is a full-length GAS AI protein. 
5 49. The immunogenic composition of any of embodiments 35-39 wherein the GAS AI 

polypeptide is a fragment of a full-length GAS AI protein. 

50. The immunogenic composition of embodiment 49 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the GAS AI protein. 

51. The immunogenic composition of embodiment 36 wherein the GAS AI-1 polypeptide is 
10 selected from the group consisting of M6_Spy0157, M6jSpy0159, M6_Spy0160, CDC SS 

410_fimbrial, IS S3 65 OJimbrial, DSM2071 Jfimbrial, and fragments thereof. 

52. The immunogenic composition of embodiment 37 wherein the GAS AI-2 polypeptide is 
selected from the group consisting of GAS 15, GAS 16, GAS 18, and fragments thereof. 

53. The immunogenic composition of embodiment 38 wherein the GAS AI-3 polypeptide is 
15 selected from the group consisting of SpyM3_0098, SpyM3__0100, SpyM3_0102, SpyM3_0104, 

SPsOlOO, SPs0102, SPs0104, SPs0106, orf78, orf80 5 orf82, orf84, spyM18_0126, spyM18_0128, 
spyM18_0130, spyM18_0132, SpyoMOlOOOl 56, SpyoMOlOOOl 55, SpyoMO 1000 154, 
SpyoMO 1000153, SpyoMO 1 000 1 52, SpyoMO 1 000 1 5 1 , SpyoMO 1 000 1 50, SpyoMO 1 000 1 49, 
ISS3040_fimbrial, ISS3776jTimbrial, ISS4959_fimbrial, and fragments thereof. 
20 53. The immunogenic composition of embodiment 39 wherein the GAS AI-4 polypeptide is 

selected from the group consisting of 19224134, 19224135, 19224137, 19224139, 19224141, 
20010296_fimbrial, 20020069 JImbrial, CDC SS 635_fimbrial, ISS4883_fimbrial, ISS4538 Jimbrial, 
and fragments thereof. 

54. The immunogenic composition of embodiment 24 wherein the Streptococcus bacteria is 
25 Streptococcus pneumoniae and the Gram positive bacteria AI polypeptide is a S. pneumoniae AI 

polypeptide. 

55. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
polypeptide comprises a sortase substrate motif. 

56. The immunogenic composition of embodiment 55 wherein the sortase substrate motif is 
30 an LPXTG motif. 

57. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
polypeptide affects the ability of S. pneumoniae to adhere to epithelial cells. 

58. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
polypeptide affects the ability of S. pneumoniae to invade epithelial cells. 

35 59. The immunogenic composition of embodiment 54 wherein the S, pneumoniae AI 

polypeptide affects the ability of S. pneumoniae to translocate through an epithelial cell layer. 

60. The immunogenic composition of embodiment 54 wherein the S, pneumoniae AI 

polypeptide is capable of associating with an epithelial cell surface. 
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|p C 'f V ItefRSPWffi^^ of embodiment 60 wherein the associating with an 
epithelial cell surface is binding to the epithelial cell surface. 

62. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
polypeptide is a full-length S. pneumoniae AI protein. 
5 63. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 

polypeptide is a fragment of a full-length S. pneumoniae AI protein. 

64. The immunogenic composition of embodiment 63 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the S. pneumoniae AI protein. 

65. The immunogenic composition of embodiment 54 wherein the S. pneumoniae AI 
10 polypeptide is selected from the group consisting of SP0462, SP0463, SP0464, orf3_670, orf4_670, 

orf5j570, ORF3_14CSR, ORF4_14CSR, ORF5_14CSR, ORF3_19AH, ORF4_19AH, ORF5_19AH, 
ORF319FTW, ORF4_19FTW, ORF5J9FTW, ORF3_23FP, ORF4__23FP, ORF5_23FP, 
ORF3_23FTW, ORF4_23FTW, ORF5J23FTW, ORF36BF, ORF4 6BF, ORF5_6BF, ORF3_6BSP, 
ORF4 6BSP, ORF5 6BSP, ORF3 9VSP, ORF4__9VSP, ORF5_9VSP, and fragments thereof. 
15 66. The immunogenic composition of any one of embodiments 22-24, 35-39, 51-54, or 65 

wherein the oligomeric form is a hyperoligomer. 

67. The immunogenic composition of any one of embodiments 22-24, 35-39, 51-54, or 65 
further comprising a Gram positive bacteria antigen not associated with an AI. 

68. The immunogenic composition of embodiment 67 wherein the antigen is selected from 
20 the group consisting of GBS 322 and GBS 276. 

69. An immunogenic composition comprising a first and a second Group B Streptococcus 
(GBS) adhesin island (AI) polypeptide. 

70. The immunogenic composition of embodiment 69 wherein a full-length polynucleotide 
sequence encoding for the first GBS AI polypeptide is not present in a GBS bacteria genome 

25 comprising a polynucleotide sequence encoding for the second GBS AI polypeptide. 

71. The immunogenic composition of embodiment 69 wherein polynucleotides encoding the 
first and the second GBS AI polypeptide are each present in genomes of more than one GBS serotype 
and strain isolate. 

72. The immunogenic composition of embodiment 69 wherein the first GBS AI polypeptide 
30 is encoded by a GBS AI-1 . 

73. The immunogenic composition of embodiment 69 wherein the first GBS AI polypeptide 
is encoded by a GBS AI-2. 

74. The immunogenic composition of embodiment 72 wherein the second GBS AI 
polypeptide is encoded by a GBS AI-2. 

35 t 75. The immunogenic composition of embodiment 73 wherein the second GBS AI 

polypeptide is encoded by a GBS AI-2. 

76. The immunogenic composition of embodiment 72 wherein the second GBS AI 
polypeptide is encoded by a GBS AI-L 
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IP C W'Z liTteiiB^SB^EPF' tion of embodiment 73 wherein the second GBS AI 

polypeptide is encoded by a GBS AI-1 . 

78. The immunogenic composition of embodiment 72 wherein the first GBS AI polypeptide 
is selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof. 
5 79. The immunogenic composition of embodiment 73 wherein the first GBS AI polypeptide 

is selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 01524, and 
fragments thereof. 

80. The immunogenic composition of embodiment 74 or 75 wherein the second GBS AI 
polypeptide is selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 

10 01524, and fragments thereof, and wherein the first and the second GBS AI polypeptide are not the 
same polypeptide. 

81. The immunogenic composition of embodiment 76 or 77 wherein the second GBS AI 
polypeptide is selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments 
thereof, and wherein the first and the second GBS AI polypeptide are not the same polypeptide. 

15 82. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 

AI polypeptide comprises a sortase substrate motif. 

83. The immunogenic composition of embodiment 82 wherein the sortase substrate motif is 
an LPXTG motif. 

84. The immunogenic composition of embodiment 83 wherein the LPXTG motif is 
20 represented by the sequence XPXTG, wherein the X at amino acid position 1 is an L, an I, or an F and 

the X at amino acid position 3 is any amino acid residue. 

85. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 
AI polypeptide affects the ability of GBS bacteria to adhere to epithelial cells. 

86. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 
25 AI polypeptide affects the ability of GBS bacteria to invade epithelial cells. 

87. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 
AI polypeptide affects the ability of GBS bacteria to translocate through an epithelial cell layer. 

88. The immunogenic composition of any one of embodiments 69-77 wherein the first GBS 
AI polypeptide is capable of associating with an epithelial cell surface. 

30 89. The immunogenic composition of embodiment 88 wherein the associating with an 

epithelial cell surface is binding to the epithelial cell surface. 

90. The immunogenic composition of any of embodiments 69-77 wherein the first GBS AI 
polypeptide is a full-length GBS AI protein. 

91. The immunogenic composition of any of embodiments 69-77 wherein the first GBS AI 
35 polypeptide is a fragment of a full-length GBS AI protein. 

92. The immunogenic, composition of embodiment 91 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the first GBS AI protein. 
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IP C WSrt ; y 1 !^? of an y one of embodiments 69-79 wherein the first GBS 

AI polypeptide is in oligomeric form. 

94. The immunogenic composition of any one of embodiments 69-77 wherein the second 
GBS AI polypeptide is in oligomeric form. 

95. The immunogenic composition of any one of embodiments 69-79 wherein the first and 
the second GBS AI polypeptide are associated in a single oligomeric form. 

96. The immunogenic composition of embodiment 95 wherein the first and the second GBS 
AI polypeptides are chemically associated. 

97. The immunogenic composition of embodiment 95 wherein the first and the second GBS 
AI polypeptides are physically associated. 

98. The immunogenic composition of embodiment 93 wherein the oligomeric form is a 
hyperoligomer. 

99. The immunogenic composition of embodiment 94 wherein the oligomeric form is a 
hyperoligomer. 

100. The immunogenic composition of embodiment 76 wherein the first GBS AI polypeptide 
is GBS 80 and the second GBS AI polypeptide is GBS 104. 

101. The immunogenic composition of embodiment 74 wherein the first GBS AI polypeptide 
is GBS 80 and the second GBS AI polypeptide is GBS 67. 

102. The immunogenic composition of any one of embodiments 69-79, 100, or 101 further 
comprising a GBS polypeptide not associated with an AI. 

103. The immunogenic composition of embodiment 102 wherein the GBS polypeptide not 
associated with an AI is selected from the group consisting of GBS 322 and GBS 276. 

104. The immunogenic composition of embodiment 103 wherein the GBS polypeptide not 
associated with an AI is GBS 322. 

105. An immunogenic composition comprising a first and a second Gram positive bacteria 
adhesin island (AI) polypeptide. 

106. The immunogenic composition of embodiment 105 wherein a full length polynucleotide 
sequence encoding for the first Gram positive bacteria AI polypeptide is not present in a genome of a 
Gram positive bacteria comprising a full length polynucleotide sequence encoding for the second 
Gram positive bacteria AI polypeptide. 

107. The immunogenic composition of embodiment 105 wherein polynucleotides encoding 
the first and the second Gram positive bacteria AI polypeptide are each present in genomes of more 
than one Gram positive bacteria serotype and strain isolate. 

108. The immunogenic composition of embodiment 105 wherein the first and the second 
Gram positive bacteria AI polypeptides are of different Gram positive bacteria species. 

109. The immunogenic composition of embodiment 105 wherein the first and the second 
Gram positive bacteria AI polypeptides are of the same Gram positive bacteria species. 
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p C "¥'? ? " life*!^^ of embodiment 105 wherein the first and the second 

Gram positive bacteria AI polypeptides are from different AI subtypes. 

111. The immunogenic composition of embodiment 105 wherein the first and the second 
Gram positive bacteria AI polypeptides are from the same AI subtype. 

112. The immunogenic composition of embodiment 105 wherein the first Gram positive 
bacteria AI polypeptide has detectable surface exposure on a first Gram positive bacteria strain or 
serotype but not a second Gram positive bacteria strain or ,subtype and the second Gram positive 
bacteria AI polypeptide has detectable surface exposure on the second Gram positive bacteria strain or 
serotype but not the first Gram positive bacteria strain or serotype. 

113. The immunogenic composition of embodiment 105 wherein the Gram positive bacteria 
is S. pneumonaie, S. mutans, E.faecalis, E.faecium, C. difficile, L. monocytogenes, or C diphtherial 

114. The immunogenic composition of any of embodiments 105-113 wherein the first and 
the second Gram positive bacteria AI polypeptides comprise a sortase substrate motif. 

115. The immunogenic composition of embodiment 1 14 wherein the sortase substrate motif 
is an LPXTG motif. 

116. The immunogenic composition of embodiment 115 wherein the LPXTG motif is 
represented by XXXXG, wherein the X at amino acid position 1 is an L, a V, an E, an I, an F, or a Q, 
wherein X at amino acid position 2 is a P if X at amino acid position 1 is an L, an I, or an F, wherein 
X at amino acid position 2 is a V if X at amino acid position 1 is a E or a Q, wherein X at amino acid 
position 2 is a V or a P if X at amino acid position 1 is a V, wherein X at amino acid position 3 is any 
amino acid residue, wherein X at amino acid position 4 is a T if X at amino acid position 1 is a V, E, 
I, F, or Q, and wherein X at amino acid position 4 is a T, S, or A if X at amino acid position 1 is an L. 

117. The immunogenic composition of embodiment 105 wherein the first Gram positive 
bacteria AI polypeptide is a first Group A Streptococcus (GAS) AI polypeptide. 

118. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide comprises a sortase substrate motif. 

119. The immunogenic composition of embodiment 118 wherein the sortase substrate motif 
is an LPXTG motif. 

120. The immunogenic composition of embodiment 119 wherein the LPXTG motif is 
represented by XXXXG, wherein the X at the first amino acid position is an L, a V, an E, or a Q, 
wherein the X at the second amino acid position is P if the X at the first amino acid position is an L, 
the X at the second amino acid position is a V if the X at the first amino acid position is an E or a Q, 
or the X at the second amino acid position is a V or a P if the X at the first amino acid position is a V, 
wherein the X at the third amino acid position is any amino acid residue, and wherein the X at the 
fourth amino acid position is a T if the X at the first amino acid position is a V, an E, or a Q, or the X 
at the fourth amino acid position is a T, an S, or an A if the X at the first amino acid position is an L. 

121. The immunogenic composition of embodiment 117 wherein the first GAS AI 

polypeptide affects the ability of GAS bacteria to adhere to epithelial cells. 
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jp|[;;;"|2^ of embodiment 117 wherein the first GAS AI 

polypeptide affects the ability of GAS bacteria to invade epithelial cells. 

123. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide affects the ability of GAS bacteria to translocate through an epithelial cell layer. 

124. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is capable of associating with an epithelial cell surface. 

125. The immunogenic composition of embodiment 117 wherein the associating with an 
epithelial. cell surface is binding to the epithelial cell surface. 

126. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a full-length GAS AI protein. 

127. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a fragment of a full-length GAS AI protein. 

128. The immunogenic composition of embodiment 127 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the GAS AI protein. 

129. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a first GAS AI-1 polypeptide. 

130. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a first GAS AI-2 polypeptide. 

131. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a first GAS AI-3 polypeptide. 

132. The immunogenic composition of embodiment 117 wherein the first GAS AI 
polypeptide is a first GAS AI-4 polypeptide. 

133. The immunogenic composition of any one of embodiments 117 or 129-132 wherein the 
second Gram positive bacteria AI polypeptide is a second GAS AI polypeptide. 

134. The immunogenic composition of embodiment 133 wherein the second GAS AI 
polypeptide is a second GAS AI-1 polypeptide. 

135. The immunogenic composition of embodiment 133 wherein the second GAS AI 
polypeptide is a second GAS AI-2 polypeptide. 

136. The immunogenic composition of embodiment 133 wherein the second GAS AI 
polypeptide is a second GAS AI-3 polypeptide. 

137. The immunogenic composition of embodiment 133 wherein the second GAS AI 
polypeptide is a second GAS AI-4 polypeptide. 

138. The immunogenic composition of embodiment 129 wherein the first GAS AI-1 
polypeptide is selected from the group consisting of M6_Spy0157, M6JSpy0159, M6_Spy0160, CDC 
SS 410_fimbrial, IS S3 65 OJimbrial, DSM2071 Jfimbrial, and fragments thereof. 

139. The immunogenic composition of embodiment 130 wherein the first GAS AI-2 
polypeptide is selected from the group consisting of GAS 15, GAS 16, GAS 18, and fragments thereof. 
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IP C It 4 ?*'" iTSS IKKB^I^PiISPB^^P^ 011 °f embodiment 131 wherein the first GAS AI-3 
polypeptide is selected from the group consisting of SpyM3_0098, SpyM3_0100, SpyM3_0102, 
SpyM3_0104, SPsOlOO, SPs0102, SPs0 1 04 5 SPs0106, orf78, orfSO, orf82, orf84, spyM18_0126, 
spyMl<8_0128, spyM18_0130, spyM18J)132, SpyoMO 1000 15 6, SpyoMO 1000 155, SpyoMO 1000 154, 
SpyoMO 1 000 1 53 , SpyoMO 1 000 1 52, SpyoMO 1000151, SpyoMO 1 000 1 50, SpyoMO 1 000 1 49, 
ISS3040_fimbrial, ISS3776_fimbrial, ISS4959_fimbrial, and fragments thereof. 

14 1. The immunogenic composition of embodiment 132 wherein the first GAS Al-4 
polypeptide is selected from the group consisting of 19224134, 19224135, 19224137, 19224139, 
19224141, 20010296jfimbrial, 20020069 Jfimbrial, CDC SS 635_fimbrial, IS S48 8 3 Jfimbrial, 
ISS4538 fimbrial, and fragments thereof. 

142. The immunogenic composition of embodiment 134 wherein the second GAS AI-1 
polypeptide is selected from the group consisting of M6_Spy0157, M6jSpy0159, M6_Spy0160, CDC 
SS 410Jfimbrial, ISS3650_fimbrial, DSM2071 Jfimbrial, and fragments thereof. 

143. The immunogenic composition of embodiment 135 wherein the second GAS AI-2 
polypeptide is selected from the group consisting of GAS 15, GAS 16, GAS 18, and fragments thereof. 

144. The immunogenic composition of embodiment 136 wherein the second GAS AI-3 
polypeptide is selected from the group consisting of SpyM3_0098, SpyM3_0100, SpyM3__0102, 
SpyM3_0104, SPsOlOO, SPs0102, SPs0104, SPs0106, orf78, orf80, orfS2, orf84, spyM18_0126, 
spyM18_0128, spyM18_0130 3 spyM18_0132, SpyoMOlOOOl 56, SpyoMO 1000 15 5, SpyoMO 1000 154, 
SpyoM01000153, SpyoMO 1000152, SpyoM01000151, SpyoMO 1000 150, SpyoMO 1000 149, 
ISS3040jHmbrial, IS S3 776 Jfimbrial, ISS4959_fimbrial, and fragments thereof. 

145. The immunogenic composition of embodiment 137 wherein the second GAS AI-4 
polypeptide is selected from the group consisting of 19224134, 19224135, 19224137, 19224139, 
19224141, 2001 0296 Jimbrial, 20020069jSmbrial, CDC SS 635_fimbrial, ISS4883_fimbrial, 
ISS4538_fimbrial, and fragments thereof. 

146. The immunogenic composition of any one of embodiments 1 17-132 or 138-141 wherein 
the second Gram positive bacteria AI polypeptide is a Group B Streptococcus (GBS) AI polypeptide. 

147. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide 
comprises a sortase substrate motif. 

148. The immunogenic composition of embodiment 147 wherein the sortase substrate motif 
is an LPXTG motif 

149. The immunogenic composition of embodiment 148 wherein the LPXTG motif is 
represented by the amino acid sequence XPXTG, wherein the X at amino acid position 1 is an L, an I, 
or an F and the X at amino acid position 3 is any amino acid residue. 

150. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide 
affects the ability of GBS bacteria to adhere to epithelial cells. 

151. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide 

affects the ability of GBS bacteria to invade epithelial cells. 
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|p 4»p2 t .* ijrjjie^im^^ of embodiment 146 wherein the GBS AI polypeptide 

affects the ability of GBS bacteria to translocate through an epithelial cell layer. 

153. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 
capable of associating with an epithelial cell surface. 
5 154. The immunogenic composition of embodiment 146 wherein the associating with an 

epithelial cell surface is binding to the epithelial cell surface. 

155. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 
a full-length GBS AI protein. % 

156. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 
10 a fragment of a full-length GBS AI protein. 

157. The immunogenic composition of embodiment 156 wherein the fragment comprises at 
least 7 contiguous amino acid residues of the GBS AI protein. 

158. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 
a GBS AI-1 polypeptide. 

15 159. The immunogenic composition of embodiment 146 wherein the GBS AI polypeptide is 

a GBS AI-2 polypeptide. 

160. The immunogenic composition of embodiment 158 wherein the GBS AI-1 polypeptide 
is selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof. 

161. The immunogenic composition of embodiment 159 wherein the GBS AI-2 polypeptide 
20 is selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523,' 01524, and 

fragments thereof. 

162. The immunogenic composition of any one of embodiments 11 7- 132 or 138-141 wherein 
the second Gram positive bacteria AI polypeptide is a Streptococcus pneumoniae AI polypeptide. 

163. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
25 polypeptide comprises a sortase substrate motif. 

164. The immunogenic composition of embodiment 163 wherein the sortase substrate motif 
is an LPXTG motif. 

165. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
polypeptide affects the ability of S. pneumoniae to adhere to epithelial cells. 

30 166. The immunogenic composition of embodiment 162 S. pneumoniae AI polypeptide 

affects the ability of S. pneumoniae to invade epithelial cells. 

167. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
polypeptide affects the ability of S. pneumoniae to translocate through an epithelial cell layer. 

168. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
35 polypeptide is capable of associating with an epithelial cell surface. 

169. The immunogenic composition of embodiment 168 wherein the associating with an 
epithelial cell surface is binding to the epithelial cell surface. 
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p |p "If Q.*" llPJSj; ORJ^ 0 ? 8 ^ P n 3P^f tion of embodiment 162 wherein the S. pneumoniae AI 
polypeptide is a full-length $. pneumoniae AI protein. 

171. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI 
polypeptide is a fragment of a full-length & pneumoniae AI protein. 
5 172. The immunogenic composition of embodiment 162 wherein the fragment comprises at 

least 7 contiguous amino acid residues of the S. pneumoniae AI protein. 

173. The immunogenic composition of embodiment 162 wherein the S. pneumoniae AI polypeptide 
is selected from the group consisting of SP0462, SP0463, SP0464, orf3_670, orf4__670, orf5_670, 
ORF3_14CSR, ORP4_14CSR, ORF5_14CSR, ORF3_19AH, ORF4_19AH, ORF5_19AH, 

10 ORF3 19FTW, ORF419FTW, ORF5__19FTW, ORF3_23FP, ORF4_23FP, ORF5J23FP, 
ORF3_23FTW, ORF4_23FTW, ORF5J23FTW, ORF36BF, ORF4_6BF, ORF5_6BF, ORF3__6BSP, 
ORF4_6BSP, ORF5_6BSP, ORF3_9VSP, ORF4_9VSP, ORF5_9VSP, and fragments thereof. 

174. The immunogenic composition of any one of embodiments 105-117 wherein the first 
Gram positive bacteria AI polypeptide is in oligomeric form. 

15 175. The immunogenic composition of embodiment 174 wherein the oligomeric form is a 

hyperoligomer. 

176. The immunogenic composition of embodiment 174 wherein the second Gram positive 
bacteria AI polypeptide is in oligomeric form. 

177. The immunogenic composition of embodiment 176 wherein the oligomeric form is a 
20 hyperoligomer. 

178. The immunogenic composition of embodiment 176 wherein the first and the second 
Gram positive bacteria AI polypeptide are associated in a single oligomeric form. 

179. The immunogenic composition of embodiment 178 wherein the first and the second 
Gram positive bacteria AI polypeptide are chemically associated. 

25 180. The immunogenic composition of embodiment 178 wherein the first and the second 

Gram positive bacteria AI polypeptide are physically associated. 

181. The immunogenic composition of any one of embodiments 105-1 17 further comprising 
a Gram positive bacteria polypeptide not associated with an AI. 

182. The immunogenic composition of embodiment 181 wherein the Gram positive bacteria 
30 polypeptide not associated with an AI is selected from the group consisting of GBS 322 and GBS 276. 

183. The immunogenic composition of embodiment 182 wherein the Gram positive bacteria 
polypeptide not associated with an AI is GBS 322. 

184. A modified Gram positive bacterium adapted to produce increased levels of AI surface 

protein. 

35 185. The modified Gram positive bacterium of embodiment 184 wherein the AI surface 

protein is in oligomeric form. 

186. The modified Gram positive bacterium of embodiment 185 wherein the oligomeric form 
is a hyperoligomer. 
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ip p ^jp 7 ; 5 " E^-fflfl^^' r??* i!? S "lf ^8 bacterium of an y one of embodiments 184-186 which is a 
Group B Streptococcus bacterium. 

188. The modified Gram positive bacterium of any one of embodiments 184-186 which is a 

Group A Streptococcus bacterium. 

5 189. The modified Gram positive bacterium of any one of embodiments 184-186 which is a 

non-pathogenic Gram positive bacterium. 

190. The modified Gram positive bacterium of embodiment 189 wherein the non-pathogenic 
Gram positive bacterium is Streptococus gordonii. 

191. The modified Gram positive bacterium of embodiment 189 wherein the non-pathogenic 
10 Gram positive bacterium is Lactococcus lactis, 

192. The modified Gram positive bacterium of any one of embodiments 184-186 which has 
been inactivated and wherein the AI surface protein is exposed on the surface of the Gram positive 
bacterium. 

193. The modified Gram positive bacterium of any one of embodiments 184-186 which has 
15 been attenuated- and wherein the AI surface protein is exposed on the surface of the Gram positive 

bacterium. 

194. The modified GBS bacterium of embodiment 187 which has been inactivated and 
wherein the AI surface protein is exposed on the surface of the GBS bacterium. 

195. The modified GBS bacterium of embodiment 187 which has been attenuated and 
20 wherein the AI surface protein is exposed on the surface of the GBS bacterium. 

196. The modified GAS bacterium of embodiment 188 which has been inactivated and 
wherein the AI surface protein is exposed on the surface of the GAS bacterium. 

197. The modified GAS bacterium of embodiment 188 which has been attenuated and 
wherein the AI surface protein is exposed on the surface of the GAS bacterium. 

25 198. The modified non-pathogenic bacterium of embodiment 189 which has been inactivated 

and wherein the AI surface protein is exposed on the surface of the non-pathogenic Gram positive 
bacterium. 

199. The modified non-pathogenic bacterium of embodiment 189 which has been attenuated 
and wherein the AI surface protein is exposed on the surface of the non-pathogenic Gram positive 

30 bacterium. 

200. A method for manufacturing an oligomeric adhesin island (AI) surface antigen 
comprising: 

culturing a Gram positive bacterium that expresses an oligomeric AI surface antigen and 

isolating the expressed oligomeric AI surface antigen. 
35 201. The method of embodiment 200 wherein the step of isolating is performed by collecting 

said oligomeric AI surface antigen from Gram positive bacterium secretions in the Gram positive 
bacterium culture. 

202. The method of embodiment 200 further comprising a step of purifying. 
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204. The method of embodiment 200 wherein the Gram positive bacterium is adapted for 
increased AI protein expression. 

205. The method of any one of embodiments 200-204 wherein the Gram positive bacterium 
is a Group A Streptococcus bacterium. 

206. The method of any one of embodiments 200-204 wherein the Gram positive bacterium 
is a Group B Streptococcus bacterium. 

207. The method of any one of embodiments 200-204 wherein the oiigomeric AI surface 
antigen is in hyperoligomeric form. 

208. The method of embodiment 200 wherein the Gram positive bacterium expresses the 
oiigomeric AI surface antigen recombinantly. 

209. The method of embodiment 208 wherein the Gram positive bacterium further 
manipulated expresses at least 1 AI sortase. 

210. The modified Gram positive bacterium of any one of embodiments 184-186 which is a 
S. pneumoniae bacterium. 

211. The method of any one of embodiments 200-204 wherein the Gram positive bacterium 
is S. pneumoniae. 
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WR.CLAIM:, 

1. An immunogenic composition comprising a purified Group B Streptococcus (GBS) 
adhesin island (AI) polypeptide in oligomeric form. 

2. The immunogenic composition of claim 1 wherein the GBS AI polypeptide is selected, 
5 from a GBS AI-1. 

3. The immunogenic composition of claim 1 wherein the GBS AI polypeptide is selected 
from a GBS AI-2. 

4. The immunogenic composition of claim 2 wherein the GBS AI polypeptide is selected 
from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof. 

10 5. The immunogenic composition of claim 3 wherein the GBS AI polypeptide is selected 

from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 01524, and fragments 
thereof. 

6. The immunogenic composition of claim 4 wherein the GBS AI polypeptide is GBS 80. 

7. The immunogenic composition of any of claims 1-6 wherein the oligomeric form is a 
15 hyperoligomer. 

8 (22). An immunogenic composition comprising a purified Gram positive bacteria adhesin 
island (AI) polypeptide in an oligomeric form. 

9 (23). The immunogenic composition of claim 8 wherein the Gram positive bacteria is of a 
genus selected from the group consisting of Streptococcus, Enterococcus, Staphylococcus, 

. 20 Clostridium, Corynebacterium, or Listeria. 

J 10 (24). The immunogenic composition of claim 9 wherein the Gram positive bacteria is of 

the genus Streptococcus. 

11 (35). The immunogenic composition of claim 10 wherein the genus Streptococcus 
bacteria is Group A Streptococcus (GAS) bacteria and the Gram positive bacteria AI polypeptide is a 

25 GAS AI polypeptide. 

12 (36). The immunogenic composition of claim 11 wherein the GAS AI polypeptide is 
selected from a GAS AI-1. 

13 (37). The immunogenic composition of claim 11 wherein the GAS AI polypeptide is 
selected from a GAS AI-2. 

30 14 (38). The immunogenic composition of claim 11 wherein the GAS AI polypeptide is 

selected from a GAS AI-3. 

15 (39). The immunogenic composition of claim 11 wherein the GAS AI polypeptide is 
selected from a GAS AI-4. 

16 (66). The immunogenic composition of any one of claims 8-15 wherein the oligomeric 
35 form is a hyperoligomer. , 

17. An immunogenic composition comprising a first and a second Group B Streptococcus 
(GBS) adhesin island (AI) polypeptide. 
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|p ip Ijp* ^n^hip^i^^i^nogenip = ^oj^pO|Sit|pxi of claim 17 wherein the first GBS AI polypeptide is 

encoded by a GBS AI-1. 

19. The immunogenic composition of claim 18 wherein the second GBS AI polypeptide is 
encoded by a GBS AI-2. 

5 20. The immunogenic composition of claim 18 wherein the first GBS AI polypeptide is 

selected from the group consisting of GBS 80, GBS 104, GBS 52, and fragments thereof. 

21. The immunogenic composition of claim 19 wherein the second GBS AI polypeptide is 
selected from the group consisting of GBS 59, GBS 67, GBS 150, 01521, 01523, 01524, and 
fragments thereof, and wherein the first and the second GBS AI polypeptide are not the same 

10 polypeptide. 

22. The immunogenic composition of claim 19 wherein the first GBS AI polypeptide is GBS 
80 and the second GBS AI polypeptide is GBS 67. 

23. An immunogenic composition comprising a first and a second Gram positive bacteria 
adhesin island (AI) polypeptide. 

15 24. The immunogenic composition of claim 23 wherein the Gram positive bacteria is 

Streptococcus, Enterococcus, Staphylococcus, Clostridium, Cory neb act erium, or Listeria. 

25. The immunogenic composition of claim 23 wherein the first Gram positive bacteria AI 
polypeptide is a first Group A Streptococcus (GAS) AI polypeptide. 

26. The immunogenic composition of claim 25 wherein the first GAS AI polypeptide is a 
20 first GAS AI-1 polypeptide. 

27. The immunogenic composition of claim 25 wherein the first GAS AI polypeptide is a 
first GAS AI-2 polypeptide. 

28. The immunogenic composition of claim 25 wherein the first GAS AI polypeptide is a 
first GAS AI-3 polypeptide. 

25 29. The immunogenic composition of claim 25 wherein the first GAS AI polypeptide is a 

first GAS AI-4 polypeptide. 

30. The immunogenic composition of any one of claims 25-29 wherein the second Gram 
positive bacteria AI polypeptide is a second GAS AI polypeptide. 

31. The immunogenic composition of claim 30 wherein the second GAS AI polypeptide is a 
30 second GAS AI-1 polypeptide. 

32. The immunogenic composition of claim 30 wherein the second GAS AI polypeptide is a 
second GAS AI-2 polypeptide. 

33. The immunogenic composition of claim 30 wherein the second GAS AI polypeptide is a 
second GAS AI-3 polypeptide. 

35 34. The immunogenic composition of claim 30 wherein the second GAS AI polypeptide is a 

second GAS AI-4 polypeptide. 

35. A modified Gram positive bacterium adapted to produce increased levels of AI surface 

protein. 

-278- 



WO 2006/078318 



PCT/US2005/027239 




„, 3jS. ..Xtejjmo^jSiied ps^p^ipo^itw^zctGrivm of claim 35 wherein the AI surface protein is in 



37. The modified Gram positive bacterium of claim 36 wherein the oligomeric form is a 
hyperoligomer. 

38. The modified Gram positive bacterium of any one of claims 35-37 which is a non- 
pathogenic Gram positive bacterium. 

39. The modified Gram positive bacterium of claim 38 wherein the non-pathogenic Gram 
positive bacterium is Lactococcus lactis. 

40. A method for manufacturing an oligomeric adhesin island (AI) surface antigen 
comprising: 

culturing a Gram positive bacterium that expresses an oligomeric AI surface antigen and 
isolating the expressed oligomeric AI surface antigen. 



oligomeric form. 
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^^^^LiH^^^^^AA 2603_ a il . seq 

TAACACCCATTGCAAATAACATCAATCAAATtDcGCGTGCTATCAACCAA a909_aTl.seq 

ACCCATCTAATTTCTCA CC CTCAC T ATCACTTTCATCCAGAACCACCTTA Majority 
™ ? SO 90 IS) 

KCCATCtAATTTrTrifr^Tp* * *^CiCTTTCATCCAGAACCACGTTA cjblll alltseq 

ACCCA^C^AA???r^r^^I^ CTATCACTTTCATCCAGAACCACC n«316~all .se^ 

ACCCATCTAATTTCTCAGCCTCAGTATCACTTTCATCCACAAGCACCTTA a909_al 1 .seq 

AGCCCAACAGCACCGGGGGTTGCTCTGATYC GGTTA'AGTAA'TAAGCAa'aa Majority 
'?° »30 ,40 ~~Z> 

AGCCCAACAGCA Grrrrr pi^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^A^A^^? llt!^ 
A"ccCC^CACCACCrrr£rH^ cjbl7l_ail?seq 
ACCCCAAC AG C^CCC^t r^l G "CTGA.TTCGGTTAACTAATAAGCAAAA „ta3I61al 1 .s^ 
AGCCCAACAGCAGGCGGGGTTGCTCTGATTCGGTTAAGTAATAACCAAAA a909_a7l.. seq 

TCCTCAGGAATTTT CCCA CTAATAAGAGGCCTTGTTGACGT. ATACGCATT Hajorlty 

■ - 1 ? 0 l7 .° ■ IfO 19Q ~l£o 

^^^^^^^^^^^^^^^^^^^tT^G^I^^^^^r^^ 2603_ali .seq 

TC^CACcK^??rrr A ^? AA J AAGAGGCCTTGTTGAGCTA T A G «'»" cjblU.ailtL, 
J C C A C C A A T ? ? ? r r r 4 r I * f £ f ■* C A ° G C G T T C T T G A G C T A T A G C G A TT neBSfeZall.s^ 
TCCTCACCAATTTTCCCACTAATAAGAGGCGTTGTTGAG CIA T AG G C A T T a909_ail. s eq 

ACCTCTCCCATACTeCATCATCCTCACAAAA TCACCATTCCCCATAAAAG Majority 
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^^G^c^? A ^r^ A jr A ^^ cjtiu^n^ 

ACCTCTG CCATA r t r r L GTCACAMATCACCATTC( ? CCATA ' AAAfi nem316_ail.se£ 

ACCTCTGCCATA GTCCATGATGGTCAC AAA ATGACCATTGCCCATAAAAG a909_aTl.seq 

CGCCAACCAAAACTTCACCTGTG C C C G T T T TTTCA TGTC A AC ACC AGTAT Majority 
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CCCCAACCAAAAfTTrirriS G .H' G c G T T T TTTCA T G T C A A C A C C AG T AT 2603_aH .seq 
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CATGTA'CCCC TT A A r irrrrrT^^' ^ A ^ G ^ G e * * ^ * ** A CAT A A T A 26^!, .seq 
G AT C TA C r r-^ T T 4 ^ r 4 t r S ^ o CCTATCTCCA'ACACAtAIAAIA 18rs21 i all^eq 

C A T G I AC tc C T T A A r AT r^rrTlrr'' 1 ' A T C T 9 c A A C . A G' A G A T AA TA cohl_all .seq^ 
GAT'CTA^r'r'rTX A ' A n A T^^ ^ G ^ G ^ AA ^ AGAGA ^**^* cJbUl_aU Jeq 

GATcJAdCCC^A ACA ? GCCC ^^ G ^ 

GATGTACC C C T T A A C.A T G C CGTTTGCTATCTCCAACAGAGAT A A T A' a909_aU.seq 



# /?/«7 - 

^PCT/US200 5 /027239 pagaa 

T C A C C A j A /C fc g ffl fjg jggjjS Mg A A C C G T A ATCTTGCTAT C ^TTtI ^^; ! 

. 2? J™ 380 390 ^0 

150 TGACCAAACGAACAATCACCGATGACTTTAACC. GTAATCTTGCTATrrrr ^ih.TT . 

s! TCACCAAAGCAArAAT^ A ^^^ A ^^ A ^'*^'^'^ AA ^' nefl#l6~~all~seq 

151 TCACCAAAGCAAGAATCACCGATGACTTTAACCGTAATCTTCCTATCGCC a909_au"eq 

TTTTTCATTA G-C C G C T A A T A'T TTGATTAGC A ATCACGGTCCGACCAAGAfi Majority 
: f£ . 4 ?° j30 440 ~~Zo 

T T T T t C A^ ? A G C C G r T A A t t t 2 S 03_aU. seq 
^"lGA^ A rr^^ AA ? A ^^! TTACCAATCA " CTCCCACCAA,: *« : I8rs2I_aU^eq 
iliII5 AGCCCCTAATATTTGATTAGCAA TCACGGTCCCACCAAGAG cnhl all <^ 

TTTTTCAT- TAGCCCCTAATATTTGATTAGCAATCACGGTGCGACCAAGAG a^a it 

CAACTGTTGAAGACGACA'AGGTATGATGT T TTTCCTGAGCCAATTTA A CI Majority 

12°_ 490 500 

CllcttTICAACACririijfTiTri 2603^1,^ 
MAVHTTCAAGACCACAAGGTATGATGTTTT T.C CTGACCCAATTTAACA utrrin ,.11™ 

C A A C T G T T G A A C A C C A C A A G C T A T C A T G T T TT T C C T G A C C d A A T T T A A C A «nf aU w" 

C'AA'C IS t TB A AC A f f 1 r'l i f pt'*' - ' ^"T''^^^''''''''^''' a a TIIAACA cJblTi alt^eq " 

ciicTdTTC-A A r AC ^ AC AAGG I A ^ CATGTTTT TCCTG ACCCAATTTA ACA n^lCaU.s^ 

CAACTCTTGAACACGACAAGGTATGATGTTTTTCCTGACCCAATTTAACA aSOgjiAl.seq 

GTCTCCGTGCTATCG AG CACATAAGACCTG A ACGCACCACACGCA.fiAAAT Majority 
. f»2 5 ?° 530 . 54 0 ■' 550 

CICTCG GTG'^TAT'CCA r r A r A T A A r A r ^ ^ *' A G G G A G G A G A G G C A C AA AT 2 603_all. S e ( j 
»» li,T C G GTG CT A.TC 0 AG C AC AT A A G A C C T C A AGCC A C C A C A G G C A C A A A T Irt^i ^ a t ™ 

ni^^^5 ?TATCGAGCACATAAGACCTGAAGGCA CCAGACGCAGAAAT cibill all sea 
rl^S^ TGCTATCGAGCACATAAGACCTGAAGG CACCAGAGCCAG nem3 6~a s2 

CTCTCCGTGCTATCGAGCACATAACACCTGAAGGacCAGAGGCAGAUT Sl!^ 

.TGATTTA AT AATTTT A-TCC.AT A A CCTAT GTTATA.GC A CAAACACA C A C TT Majority 
5 ?° - 570 580 590 600 

S TGATTTAATAatttti^ 2603 aii.seo. 

I A I GGA I AAGG I ATGTTATAGG ACAAAGAGAGAGTT ISrsSl.autL, 

?A ir A iiJ AA I A TTTTATCCATAACCTATGTTATAGCACA AAGAGAG ail seT^ 

^ I GA n TAATAATTTTATCCATAACCTATCTTATAGGAGA AAGAGAGAGTT SblU aiHea 

J J ! TAATTTTATCCATAACCTATCTTATACCACA AACAGAG nt 1 

>1 TGATTTA ATA A T T T T A T C C A T A A C.C T A T G T T AT ACCACAAA GAGA GAGTT a^alt!*^ 

TCTATCC AC T.C G A TT AG CT GA AA A T-T T f A C A C .T A A A A A C T AG TCCCACTC Majority 

6 *° 620 - 630 , , 640 650 

ZtZ\ t G GAC tMG A T.T A G C T G A A A AT T T T A ,G A C T A A A A A G T A GTGCCAG t C 2603 ail sea 
- r * J £ a r» J r* ? A T ^ A G C T G A A A A T T T T A.G A C T A A A A. A G'TA G T G C C A G T G St aifsed 

tl iZ^Z f C T C G A TT A G 6 A A .A AT T T T AG A CTAAAAAGT AG T G C ACT G cdM ait se^ 
*I A 'ZZ ^ A C T G «.A TT A G G;T G AAA A T T;T T A C A C T.A A A A A G T A G T G C.C A G "3 lit ait^eq 
rl Ht S £ ^GGATTAGq^G A : ;A>A : A- % T%T A G Av CIA A A A A G T A G t G C C AG ic S^tS 
C T A T 9 A C T Q .G Jfc T T A . G -C T' G .VA^T^ A G A C T A A A A A G T A G T G C C ^ G f ^S^ 

C A Tfl T G A A A G G T A A C A A A AG ACT C T T C C G T T T A A A C G A G A AG ,CC TC TT T Halnrify 
. . . : 660 670 680 690 700 



loo 

69 
101 
!0O 
E01 

m 



so 

19 
51 
SO 
51 
51 



DO 
69 



DO 
31 
01 



£ y A ZlZZ$tttZ GTA * CkkAAGAGTCTrCCeTTTAkACG * G **CCC tH T T i. 2603 ail sea 

? riZZZ 3 r A A A T A C A A A A G A G T.C TTCCGTTTAAACGAGAAG CC.tBttT ISrsit I Heq 

;l G K T T T T G A A A G.G T A A C A A A A G A C T ; C T T CC G T T T A A A C G A G "A A G C C T^T T T coht alt seT^ 

^ r A TTJJ!^" AA l iGG ^ AAGAAAAGAG ^ C ^ T G ^ ^ T cjbnt all ^eq 

! G -A T T T T G A A A G G T A A r a a a ' a r a r ^CCGTTTAAACGA G.A AGCCTGTTT neiiA316Zaii*seq 

a GATTTTGAAAGGTA AC A A A A G A G T0T TCC CTTTA A AC C A CA A CC C TGTTT a909_ail.seq 
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)1 



00 TTAGTCCTTTCAGATCAAGGCAAAAGCACGATCAAATAATTCCCTACCTT 2603 all seq 
i69 TTAGTCCTTTCAGATGAAGGCAAAAGGACGATGAAATAATTCCGTACCTT 18rs2i al H 



„ „ , „ „. » A A T A A T T C C G T A C C T T 18rs21 ail.seq 

01 TTAGTCGTTTCAGATCAAGGCAAAAGCACGATGAAATAATTCCCTACCTT cohl all seq 

00 TTAGTCGTTTCAGATGAAGGCAAAAGGACGATGAAATAATTCCGTACCTT cjblTl ail seq 

01 TTACTCCTTTCAGATCAAG'GCAAAAGGACCATCAAATAATTCCCTACCTT nta316~ai Tseq 
Ol TTAGTCGTTTCAGATGAAGGCAAAAGGACGATCAAATAATTCCGTACCTT a909_a7l .seq 

CATCGATTGCTATGTTACT GCCATCAG G»T C T C A C C A T AT T T A € T A A GAT A Majority 
, 7 ?° 770 780 790 800 

50 C ATGGATTGCTATCTTACTGGCATGAGGTCTCACGATATTTAGTAAGATA 2603 ail seq 
19 CATGGATTGCTATGTT AC TGGCA.TGAGG T.C TCACGATATTTA GTA AG ATA 18rs21_al l!seq 

51 CATGGATTGCTATGTTACTGCC.ATGAGCTCTCACGATATTTAGTAAGATA cohl_ail.seq 

50 CATGGATTGCTATGTTACTGCCATCAGGTCT-CACGATATTTAGTAAGATA cjblll ail seq 

51 CATGGATTGCTATGTTACTG GCATGAGGTCTCAC GATATTTAGTAAG-ATA ne*316 ail seq 
^ CATGGATTGCTATGTTACTG.GCATGAGGTCTCAC.GATATTTAGTAAGATA a9G9_a7l.seq 

TTCGTTt. CAAGATATT CCCACCTATTTTTTAAAGCTTTTAACAAAATCTC Majority 
8*0 820 830 840 850 

TTCGTTTGAAGA-TATTCCCACGTATTTTTTAAAGGTTTTA.AGAAAATGTG 2603 ail seq 

TTCGTTTGAAGATATTCCCACGTATTTTTTAAAGGTTTTAACAAAATGTG 18rs2l allLa 

T T>C G T T T C A A G A T A T T C C C A C G X A T T T T T T A A A G G T T T T A A C A A A A T C T G cohl all seT 

T T C C T T T G.A A G A T A TT C C C A C G T A T T T T. T T A A A G G T T T T A A C A A A A T G T G cjblll ailTseq 

T T C G T T T G A A .G A TATTCCCA CG T A JTTTTT AA AGGTTTT A AGAAAATGTG ne*316 ail.seq 

TTCGTTTCAAGATATTCCCACGTATTTTTTAAAGGTTTTAAGAAAATGTG a909_ail.seq 

TG C T G T C GTA A A A A T GT A AT A A TT TC C C T A CTTC CC C A AT C G CT A C CC C T Majority 
800 . 870 880 890 900 

50 T G C T G T C G T A A A A A T G T A A T A A TTTCGCTACTTCCCCAATC GCTACCCCT 2603_ail.seq 
19 T G G T G TCGTAAA AATGT A. A T A A T T T-C G-C T A C T T C C CC AATCCGT ACCCCT 18rs21_ai 1 seq 

51 TGGTGTCGTAAAAATGTAATAATTTCGCTACTTCCCC A ATCGGT ACCCCT cohl ail.seq 

50 TGGTGTC GTA AAAATGTAATAATTT CG.C TACTTCCCCAATC GCTACCCCT cjbli i_ai Iseq 

51 TGGTGTC GTAAAAA.TGTAA T/A A TTTCGCTACTTCCCCAATCGGT* ACCCCT neo316_all seq 
51 TG GT GT.C G T A A A A AT GT A AT A A TT T C G CTACTTCC CCAATCCGTACCCCT a909_all.seq 

-CTTT GC ACTACf AATTCTCCCTCCTTAATTTTTCCCT T T A CAATATA ACT Majority 
9{Q . 920 930 940 950 

X) CTTTGCAGTAGTAATTGTCCCTCCTTAATTTTTGCCTTTAGAATATAACT 2603 ail.seq 
59 CTTTGCAGTAGTAATTGTCCCTCCT T.A ATTTTTCCCTTTAG AATATAACT 18rs21_ail.seq 
11 CTTTGCAGTAGTAATTGTCCCTCCTTAATTTTTGCCTTTAGAATATAACT cohl ail.seq 
X) CTTTGCAGTAGTAATTGTCCCTCCTTAATTTT. TGCCTTTAGAATATAACT cjbUi_.ail.2eq 
)l CTTT'GC AG. TAG T AATTCTCC CT C'CTTA ATTTTTCCCTTTAG A' A TATA ACT nearfie.all.seq 
CTTTGCAGTAGTAATTGTCCCTCCT T A A TTTTTGCCTTTAGAATATAACT a909_all.seq 



51 



00 
69 
01 
00 
01 
01 




T T C C A A G C A A A T G T C A G A C T A T T T T TTAAAAACTGAGCGTAACTGCCAAT' Majority 

, . 960 . ' : 9 T° ' , 980 , 990 ' - 1000 

» T T GC A AG C A A A T GT CAGAGTATT TT T T A A A A ACT GA GCG T A A GT G CG A A T 2603_ail.seq 
I II ? 5 a 5"* £ A 5 £ 1 ? A ? A ?'I A 1 ?2 I 5? A >* A A •? T G A G CG T A AG.TGGC A A t f8rs2lLaii^ 
» 

ft 

CT G A C A T AT AT A GC T A C.T t-C CCA A TAT C A GAT ACT T T G A G TTTGGAGTAG Majority 

J 1010 1020 . ' , 1030 . . 1040 1050 

m CT.'G AG AT AT AT AC G T A G t T G G.C A A T A T C A GAT ACTTT.GA GTTTGG AG TAG 2603__ail.seq 
59 C.T,C A G AT A T A T A G GT A GT T G G C AATATCAG AT ACTTTG A G.T TTGGAGTAG 18rs2 1.1a ii. seq 
161 X TG A GAT.A TAT AG GT AG T T G G C A AT ATC AGAT ACTTTGA.G TT TCGACTAG cohUail .seq 
KK> HT-G A GAT A T A TA'GG T.A GTTG'GCAATA TC A G AT ACTTTGAGTTTGGAGT AG cjblll_ail seq 
101 C T^G AGATATATA GGTAGTTG GC'AA'TATCAGATACTTTGA GTTTGG AGTAG nem316_a 11. seq 
m CT-GAG AT AT AT A GGT A C T T .C G C A AT A. T C A G A T A CTTT GAGTTTGGAGTAG a909_all.seq 
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J2//^S7 
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Mtgnmen* Report of AM_aBgnment using J. Hein method with Weighted residue weight table. a 

^7iW^JclVMRWc^Z^P,^ — : ^ — — 

AGGTCCTTCTCCACATAA M« UC A ATAC TA T T G T ACATTTCCTGCTTCTC Majority 



1060 1070 1O80 1090 1 100 



1050 AGGTGCTTGTCCACATAATGGAGAATACTATTGTACATTTGCTGCTTGTC 2603 all seq 
B19 ACCTCCTTGTCCACATAATGGACAATACTATTCTACATTTCCTCCTTGTC 18rs21 ailseq 

1051 AGGTGCTTGTCCACATAATGG'AGAATACTATTGTAC ATTTCCTGCTTGTC cohl aTl seq 

1050 A GGTGCTTGTC CACATAATGGAGAATACTATTGTACATTTGCTGCTTGTC cjblll ailseq 

1051 AGGTGGTTGTCCACATAATGGAGAATACTATTGTACATTTGCTGCTTGTC nem316~ail "seq 
1051 AGGTCGTTGTCCACATAATGGAGAATACTATTGTACATTTGCTGCTTGTC a909 ail seq 



A G Ji 



GATGCTCTTATTGGTTAA GG AT TCTGAAAAATCAATAACAGCTCCAC Majority 
n , 10 1120 U30 1140 1150 

— » — ■ I ■ • • 




LISO A GCC AATTC T.T CAAACATCAATAACATCAGCACGCTCT T V C GTTTAAAGCC 2603_ail seq 

J19 ACGG AATTCTTG A AACATCAATA AGATCAGG AGCCTCTTCGTTTAAAGCC 18rs2I_aiKseq 

1151 AGiCGAATTC'TTGAAACATCAATAAGATCAGGAGCCTCTTCCTTTAAAGCG cohUail seq 

1150 AG CCAATTCTTGA A A C ATC A A T A AG AT C AG C A G CCTCTT.C GTTTAAAGCC cjblll_ail seq 

1151 AGCCAATTCTTGAAACATCAATAAGATCACGAGCC T_Ht TCGTTTAAAGCC nem316_ail seq 
1151 AGCCAATTGTTGAAACATCAATAAGATCAGGAGCCTCT T.C G TTTAAAGC C a909_ail.seq 

ATATACTC.CTT TACCAGCGCATAACTTTTAGCCACATCACTATTTTCCTC Majority 
t210 122Q 1230 1240 1250 

1200 A T AT AG T GCTTT A C C ACCCC A'T A AC TTTT A'CCC AC ATC ACT ATT TTCCTC 2603_ail seq 
>69 A T A T A G T C C T T T A C C A G C G C A T A A C T T T T A G C C A C A T C A G T A T t T T C C T C 18rs21_ail seq 

1201 ATATAGT.GCTTTACCAGCGCATAACTTTTACCCACATCACTATTTTCCTC cohi_ail seq 

1200 ATATACTGCTTTACCAGCGCATAACTTTTAGCCACATCAGTATTTtBcTC cjbUl.ail seq 

1201 A T A T A C T G C T T T A C C A G C G C A T A A C T T T T A CCC ACATC AGTATTTTCCTC neo316_all*seq 
1201 A T A T A G T G CT T T A C C A G C G C A T A A C T T T T AG CCACATCAGTATTTTCCTC a909laii.seq 

G A AACTTAATT CT A GTA ATTTTGTT A A GTA AACAACACTTAAGTTCTTTT Majority 
- " 1260 1270 1280 1290 1300 

250 GAAACTT A A TTCTACT A ATTT.TGTTAAGTAA AC AACAGTTAAGTTCTTTT 2603_all seq 
019 GAAACTTAATTCTAGTAATTTTGTTAAGTAAACAACAGTTAA.GTTCTTTT 18rs21__ail seq 

251 G A AACTTA ATTCT AGTAATTTTGTTA AGTAA AC AACAGTTAAGTTCTTTT cohl_.aU seq 

250 G A. AACTTAATTCTAGTAATTTTGTTAAGTAAAC AACAGTTAAGTTCTTTT cjblll__ail seq 

251 GAAACTTAATTCTAGT A ATTTTGTTA AGTAA AC AACAGTTAAGTTCTTTT nem316_ai Kseq 
251 GAAACTTAATTCTAGTAATTTTGTTAAGTAAACAACAGTTAAGTTGTTTT a909_ail.seq 

CAC.CTCTTAGG GCAGGCAT T G A. A C A T G A G G T A A C A C T G G A T G A T G G G A G G Majority 
. ^ 1310 1320 1330 .1340 1350 

300 C AG C T C TT.ACG G C A G G G A TTG A AG A TCA.CCT A AC A CTCGATGATGGGAGG 2603 ail seq 
069 CAJG CT C T T A G CG.CA G G G A T T G A A 0 A T G A G G T A A C A C T G G A T G A T G G G A G G 18rsSi_lail seq 

301 C A.G C T C.T T A G G G C A G G G A T T G A.A.G A T. G A G G.T A AC A.C T G G A T G A T G G G A G G cohi_ali;.seq : 

300 C A G C T C T T AC G G C .A G G G A T. T G A A G A T GAG G T A A C A C T G G A T G. AT G G G A G, G cj b 1 Ml_alt ; seq 

301 C A.C/C T C TT-A GGG C A G G G ATT 6 A AG AT G A G G TAACACT G G AT G A T G G G A G G «ei^i<Gi_ati seqt 
301 C-A GC T CT TAG Q G- CA G G G A T T G A A CATC AG G'TA AC AC TG G Art G A T CGtA ; C G^iliq 

C G A T T A A T T T C TTG CTTTAA C A G TTG A G T C tTACCCA C C T TA A C G A G ATC MaJ or i ty 



350 



1>360 " 1370 1380 1390 1400 

" ~i : 1 : : . l_ 



C G A T T A A T T T C T T C C T T T A A C A G T T G A C T G T T : A C C C A Q C T T A A C G A G. A T C 2603_ail.seq 
119 CGATTAA.TTTCTTG CTTTAAC A G T t G AfiTGTT A CCCAGCTT A ACCAC ATC 18rs21 ail.seq 
351 C G;A T T A A. T T.T C T T GCTTTAACACTTC.AGTGTTAC C C A G CTT'A ACCAC ATC eohl^aTl.seq 

350 CG-ATTAATTTCTTG C T T-T AAC'AGTTGAGTGTTACCCAGCTTAACGAGATC. cJblll^alKseq 

351 C.C A T T A A T T T C T T G C T T T A A C A G T .T G A G T G T T A C C O A G C T T A A C G A G A T C ne«316__ail .seq 
351 CGATTAATTTCTTGCTTT.AACAGTTGAGTGTTACCCAGC-TTAACCAGATC a909_ail.seq 
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# 02 /iff J 

ji™n^«~^^A.< WO 2006/078318 w ' ' ^PCT/US2005/027239 

*gnroent Report of AM_aitgnmern, using J. netn method with Weighted residue weight table. p--- c 

A A T A A T C f^^T^ltf SS^gSI l^A^ ft. C GGTAACTCAAAAGAGTTTT Majority 

I4>0 1420 1430 1440 1450 

400 AATAATGTCATTGAGATCGTTTAAAACAGTGGGTAACTCAAAAGAGTTTT 2603 ail seq 
169 AATAATGTGATTGAG. ATGGTTTAAAACAGTGGGTAACTGAAAAGAGTTTT 18rs21_all seq 

401 AATAATGTGATTGAGATGGTTTAAAACAGTGGGTAACTCAAAAGAGTTTT cohl_ail seq 

400 AATAATGTGATTGAGATGGTTTAAAACACTGGGTAACTGAAAAGAGTTTT cjblll_ail seq 

401 AATAATGTGATTGAGATGCTTTAAAACAGTGGGTAACTGAAAAGAGTTTT ne*316_ail seq 
401 AATAATGTGATTGAG A T.G GTTTAAAACACTCCCTAACTGAAAACAGTTTT a909_all.seq 

TCtTAGTATGTTTTACGTGAACAACAATATCAGGATCCGCAACAATCTGT Majority 
- 1460^ 1470 1480 . 1490 1500 

450 T C T T A G T A T G T TT TAGGTGAAGAACAATATCAGGAT C.C GCAACAATCTGT 2603_ail seq 
219 TCTTACTATCTTTTAGGTGAAGAACAATATCAGGATCCGCAACAATCTGT 18rs2i auleq 

45 1 TCTTAGTATGTTTTAGGTGAAGAACAATATCAGGATCCGCAACAATCTGT cohl_aiI.seq 
TCTTAGTATGTTTTAGGTGAAGAACAATATCAGGATCC GCAACAATCTGT cjbm.ailseq 
TCTTACTATCTTTTAGCTGAAGAACAATATCAGGATCCGCAACAATCTCT nea31G_.aU .seq 
TCTTAGTATGT T TT A G GTC A AC A ACA AT ATCACC AT'CCCC A A CA ATCTG T a909^_all.seq 

TCTGACTCTTC TAATAAATGATTCATGGCTTGTTGGCAACTAGCCTCAAA Majority 
. 15101 1520 1530 1540 1550 

500 TCTGACTCTTCTAATAAATGATTGATGllCTTGTTGGCAACTAGCCTCAAA 2603_ail seq 
269 TCTG ACTCTTCTAATAAATGATTGAT Gy C TTGTTGGCAACTAGCCTCAAA 18rs21_ail.seq 

501 TCTG ACTCTTC'TAATAAATG ATTGAT cflc TTGTTGGCAACTAGCCTCAAA cohl_all.seq 
SCO TCTGACTCTT.CTAATAAATGATTCATGGCTTGTTGGCAACTACCCTCAAA cjblll all seq 
501 T C.T G A C.T C -T.T CTAA.TAAATG ATT G A TGGCTTGTTGGCAACTAGCCTCA AA riea316lall"seq 
501 rTCTGACTCTTCTAATAA.ATGATTGATGGCTTGTTGGCAACTAGCCTCAAA a909_all.seq 

CTGTC TTTGGAAAAAGGCATCGATAGACACAAGAAGACTACGTATACTGG Majority 

1560 157Q 1580 1590 1600 

550 C T G T G T T T .G G A A A A A G G C A T C G A T A G A C A C A A G A A G A C T A C G T A T A C T G G 2603_all seq 
W9 C T G T G T T T C G A A A A A G G C A T C G A T A G A C A C A A G A A G A C T A C G T A T A C T G G 18rs21_ail seq 
>5i C T G T G T T T G G A A A A A G G C A T C G A T A G A C A C A A G A A G A C T A C G T A T A C T G G cohl_ail seq 
>50 C T G T G T T T G G A A A A A G G C A T C G.A T A G A C A C A A C A A G A C T A C G T A T A C T G G cjblll_ail seq 
5«=i C T G T.G T T T G G A A A A A G G C A T C G A T A G A C A C A A G A A G A C T A C G T A T A C T G G nea316_ail.seq 
CTCTCTTTGGAAAAAGGCATCCATAGACACAAGAAGACTACGTATACTCG a909„ail.se<P 

TACTACCAAAA CAAGGCACAAGCTTTATATAGGATAACATTTCTTTTTTA Majority 
1610 1620 1630 1640 1650 



451 
450 
451 
451 



551 



JOO TAGTAGGAAAACAAGGGACAAGCTTTATATAGGATAAGATTTCTTTTTTA 2603' ail. seq 



TACTAGGAAAACAAGGGACAAGCTTTATATAGGATAAGATTTCTTTTTTA 18rs21_ail seq 
JOl TA.GTAGGAAAACAAGGCACAAGCTTTATATAGGATAAGATTTCTTTTTTA cohi_aii seq 
JO0 TAGTAGCAAAACAAGGGACAAGCTTTATATAGGATAAGATTTCTTTTTTA ejblil„ail seq 
JOl TAGTAGGAAAACAAGGGACAAGCTTTATATAGGAT A AG ATTTCTTTTTTA nem316_aii seq 
JOl TAGTAGGAAAACAAGCGACAAGCTTTATATAGGATAAGATTTCTTTTTTA a909_aii.seq 

CTA CGATGAC A A AATTGTTCTACAAA GC GA CTGG ATAACTGTTCTTGC CT Majority 

.1660 1670 1680 1690 1700 

150 H T A C GAT G AG A A A A T T G T T C T. A G A A AG C G A C.T G G ATA A C T GTTCTTGCCT 2603_ail.seq 
119 G T.A C G A T G A GA A A A T T CT T C T A G A A. A G C C A G T G G A T A A C T.G f T C T T GC C T isf^lju'l seq 
iSt C T A C G A T G A G A A A A T T G T T C T A G AAA G C G A C T G G .A T A A C T C TT.C T: T G C C T cohl_ail seq 

150 CT AC G A TG AG A AAA T T G T T C T A. G A. A A GC G - A. C T. G G" A T. 'A A .C T G T -T C T. t. G C C T *cj bVML&ki seq 
\S1 C T-ACftH GA.fc A A A A T T G T T- C T- A; G- A A A; G* C G A C T G G A t A A C TVC^Tft^C 'T;T 'G5"0 'T^'i^ie^^seq^ 

151 0 T A C G A T G A G A A A A T T G f T C T ?;A C aJl^ G C G A C *' : G- 1 «;- , A-¥--a;a\.C;T.^ ^.T^^-117^ j'e^^i|.&gVv 

ATT CA T At CAGGGC TAT AG G GA T "A AAA T CGTC G A: AT A:G C A A T A A >G I; A T A T;T Majority . 
1710 1?20 - .1730 1740 1750 

00 A T T G A T A T C A G G G C T A T A G G G A T A A A A T G G T C C A A T A C C A ATA A G A T A T T 2603iail.seq 
«9 ATT GATATCACGGCTAT A G G G AT A A A A T G G T G C A A T A G C'A ATAAG AT AT T 18rs21_aU.seq 

01 A T T G A T A T C A G G G C T A T A G G G A T A A A A T G« T C C A A T A G C A A T A A G A T A T T eohLail.seq 
-00 ATTGATATCAGGGCTATAGGGATAAAATGGTCCAATAGCAATAAGATATT cjblll_aU.seq 
Oi A T.T G A T A T C A G G G C T A T A G G G A T A A A A T G G T C C A A T A G C A A T A A G A T A T T Aea316_ail.seq 
'Ol A'T TGATATCAGGG CT AT A GGG AT AAA AT GGTCCAA T-A GCAATAAG.ATAT'T a909_all seq 
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9 22>/fy%~7 

^r^1^ 1 ^^ 00 f /0 - 783l8TO ^ ^ w ^^^^- ^PCT7US2005/027239 ^ ^ 

CTGATATCTTG'GTTATAAGGGA TACC T AA ' A TGTTTTAAA'^ACTGATACTA 
• ■ - 1840 ^50 

sot ctcaiatcttccttaJaIccc ^5^1 M^H TTAAAUCTCATAG - tA »8wa|_Ml!3_, 

800 CTCAIATCTTCCTIATAAtCCATlrrTiiiTr^ cohl_al 1 . seq 

ACCCAACACATAGTCTTCGT T- A CCAT A TA ACTCAACCACtTCCTTCTCTr „a Jorlty 
: 18 . 6 ° 1870 IMP ,890 ^0 

S ^ITc^TaTa^^ 

85, A C G C'A A C A G A TAGTCXTCCTTA C__A L^lcJf' ^ A ^ T Ir C ^ TCTCTC 18rsZ,_a„. S eq 

85. ^caa^a^c 0 ^?^^^^ 

CT'GACATCACTGAA'ATAG-G TAGT TG AG ATATCGTATGCAATGTTTGA A f A Uajorlty 
: ! "l 10 I9 . 20 . MPO ■ 1940 ~~te~so 

», : ?§ ax : tn r. ; ; ; ; : : j ; k ; ;j j ; ? ; ? : « « « • ; .: * s J ssssi. 



900 
669 
001 
900 



tCT.TTAAAATC€AATCTAACCATTTCATACACCr. r. CTTCATTATCATTTC Majority 



9SO 

ri9 



19 _ 60 1970 19SO 1990 



■ -a . ,-- 2O00 

^^3f AAATCGAATGYAACCATTTGAT A GACCGCCTTclTTATCATTTC ?fi ^ t1 

.5, TCTTTAAAATCCAATCTAACCATTT^A^ctcc*^ 

tAGAATTTTTCTTTAGr.TTTGTAAAOACTACAAA A T A A A A T G A T.r. A A g AC HaJ r 

. .__ ■ ■ ■ : 20>0 . ^ 

E T A G A S T T T I C X Vi A C G J ?•? J ? _ M f _ _ H S ^ * T A A A W G.A X G A A A A C ZGO^aiL.seq 
»1 TAG AilTT II Ct lit Igt •? t£ ? A A ff! S I *'£•*'* * A T * f* W ATP AAA A C , ____fc4ifc3_ T 

x ag aa T rr,mm mm^msmmimunrm^^-. 

AAMAT C XT G T C C A' T-A C A CT A t A A- A G A C A C C. C T A A T. T A G C- A A A T. T h -T ri t* r Majority 

- A A C T A'¥ .C T T G T.G C A ? A C A C T A A A A A C A C A C G C T A A T T A C C A A A C T C T C ^ I8rs2, aU^seq 

- A AC T A T C T X C.T G G A T AC AC TAAAAAGACA6CC T A A ^ T A* G ^ £ A AC^CTC? S^nlail^eq 1 
AACTATCTT C T G G- A X A C A C T. A A A A A G AC ACGCXAATTA'G C AA ACTCTCTC ^to^all's^' 



300 
301 
301 



350 
U9 



)51 A 
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WanmertReportofAM.^QJ^ W PCT7US2005/027239 ■ 

rhursday, Juty 29. 2Q04 5:45 PM - 

T T C A T C AlP'<fft: Cift,.€ Aj fcjtfll. J jfck,,*' T,::jR W®£ xIa'St ATTTATATGACAAATAAACC Majority 
• i i - i r~ 

2110 2120 2130 2140 2150 

2100 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGG 2603_ail.seq 
1869 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGG 18rs21_ai 1 .seq 

2101 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGG cohi_aiI.seq 

2100 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGC cjbl 1 l_ai 1 .seq 

2101 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGC nem316_ail .seq 
2101 TTCATCATCTCTCACCATTATTATACTACTATTTATATGACAAATAAAGG a909_ail.seq 

TGATTTTCTTAAAAATATAACTTTCAAA AT CCAC AT AT ATTTTT AATCTT Majority 
2160 2170 2180 2190 2200 

_JL_ 



5150 T G AT TTTCTTAAAAlTATAACTTTCAAAATCCACATATATTTTTAATCTT 2603_ail.seq 

1919 TGATTTTGTTAAAABtATAACTTTGAAAATCCACATATATTTTTAATCTT 18rs2l_ail.seq 

2151 Til A TTTTGTTAAA AH TAT A A CT TTGAAA ATCCACA-TATATTT'TTAATCTT cohl_ail.seq 

2150 T C A T T-T TGTTA A AAATAT AACTTTGAA AATCCACAT ATATTTTTAATCTT cjblll_aii.seq 
£151 TGATTTTGTTAAAAATATAACTTTGAAAATC C AC A T A T A T T T T T A A T C T T ne«316_aii:seq 

2151 TCAT.TTTGTTAAAAATATAACTTTGAAAATCCACATATATTTTTAATCTT a909_ail.seq 

C CC TCTGAAAAAA. > T A A AT A A A A AT ACT A A A AAT A A AC A C G A ATTT A A A A Majority 
2210 2220 2230 2240 2250 

! « 1 1 I U. 

U99 CC GTCTGAAAAAA-TAAATAAAAATAGTAAAAATAAACACGAATTTAAAA 2603_.all.seq 

1968 CCGTCTGAAAAAA - T A A A T A A A A A T A G T A A A A ATA A A C A C G A AT TT AA A A 18rs21_ail.seq 

1200 CCGTCTGAAAAAA - TA A ATAAAAAT A' GTAAAAATAAA.CACGAATTTAAAA cohl_ail.seq 

200 CCCTCTGAAAAA A - T AAATAAA AATAGTAAAAATA A. AC ACGAATTTAAAA cjbUl_aii.seq 

5201 CCCTCTGAAAAA aQt A A ATAAAAAT AG T A A A A AT A A A C A CCA AT TT A A A A nen316_.ail.seq 

5201 CCGTCTGAAAAAA - TAA A TAAAA ATA G T A A A A A TA A A C A CG A ATTtA AAA a909„ail.seq 



T A A C C A A A T TT TTT A AC A A A A TCTCT C CT A A ACTTT A AT ACTt t TCTCCT Majority 
2260 . 2270 2280 2290 2300 



248 TAAGCA A ATT. T TTT A AGA A A ATCTGTGCTAAACTT. TAATAGTTTTGTGCT 2603_aU.seq 
»17 T A A G.C.A A ATTTTTTA AGAAAATCTGTGCTA A AC TTTA'ATACTTTTGTCCT 18rs2i_ail.seq 

249 TAAGC A A A TTTTTTAAG AAA. ATCTGTGCTAAACTT TAATAGTTT T G T G CT cohi_ail.seq 
249^ TAAGCAAAT. TTTTTAAGAAAATCTGTGCTAAACTTTAATAGTTTTGTGCT cjbIU_ail.seq 
251 T A A G C A A AT T T t T T A A G A A A A T C TGTGCTAAACTT TAATAGTTTTGTGCT nem316_ail.seq 

250 TAAGCAAATTTTTT A A GAAAATCTGTGCTAAACTTTAATAGTTTTGTGCT a909_ail.seq 

T A A TA ATAATCAGCACTTACAAAGAA C A A A G G G A A A A G C G AG GACAGAAC Majority 

• » " t : ' 1 : n r- J 

2310 2320 2330 2340 2350 

. « ' ' 1 : J 1- 

298 TAATAATAATCAGCACTTACAAAGAACAAAGGGAAAAGCGAGGAGAGAAC. 2603__ail.seq 
067 TAATAATA.ATCAGCACTTACAAAGAACAAAGGGAAAAGCGAGGAGAGAAC 18rs21_ail.seq 

299 TAATAATAATCAGCACTTACAAAGAACAAAGGGAAAAGCCAGGAGAGAAC cohl_.ail.seq 

299 TAATAATAATCAGCACTTACAAAGAACAAAGGGAAAAGCGAGGAGAGAAC cjblll_ail.seq 
301 T A A T A A T A A T C A G C A C T T AC A A A G A A C A A A G G G A A A A G C G A G G A G A G A A C nem316_ail.seq 

300 T A AT A A T A-A T C A G C A CT T AC A A A G A A C A A AG GG A A A A G C G A G C A G A G A A C a909_aiKseq ' 

TTTTAATGAAATTATCGAAGAACTTATTCTTTT CG G C TGCTGTTTTAACA Majority 
• - • » »• i . 1 1 ' '* " 
2360 - 2370 . 2380 2390 2400 
— — 1 . — i , _jl : l. • 

348 TTT TA ATCA A ATTATCG A AG A A G*.T'T ATT CTT.TTCGG C T G CT G TTT T A A C A 2603_ail.seq . 
117. T'TTTA A T;G A A A T TAT CC A A G A A G TT A T-T G T TT T C G G ^ T.G Qf G t %T T A A C A 18r^l_.aii.ieq-- 

349 TTT T A A TG A A A T TAT c'CAAG A...A C T/.1ft :A. T T 'h t\t t T h In. r. r T „ n i?' r. t.t *r t a A r .'a ^i-oM <w, 



349 T.TTTA AT'GA A AT f ATCG A AG A-A. G 

349 T TTtAATG A A A T -T A. T C G A A G A A G 
351 T TTT A ATG .AjAiA f : T A f G £ MA gX aJg 

350 TI T T A. A.:t s G'^. AitT^T^G % AVA--G :&&lGft 

ATG G T G G C G G G G T C A A : C T. G. T, T-C; A A C C A G: : T 1 A 'G, CT. C A C^ T?T. T : G- C G ACT G. G. A' A^*fc! Majority 
2410 2420 ' 2430 2440 2450 

. : : s : : ' -T - ■ f • - - 

398 ATCGTGG C G G G G T C A ACTGT TG A X CCAC TA G CTCAC TTTGC GA.C T G G A A f 2603_ail.seq 
167 ATG GTCCCGQG GTCAAC T GT T G A. A C C ACTA GCTC AGTTf CGG ACTGC A AT 18rs21_.all.seq . 

399 A T G G T C G.CG'G GCTC A AC'TGTTC AAC C ACTAGCTCAGTTT GCGACT G'G A A T cbhi_ail.seq 

399 A TBCT G G C.G G. G G T C A A C T G T T C A A C C A G T A G C T C A G T T T G C G A C T G G A A T cjblli_ail.seq 
401 AT G G TCG'CGCGCT C A ACTGTTGAACCAGT AGCTCAGTTT CCG ACTGC A AT. nem316_ail.seq 

400 A tHg T G G CG G G G T-C .A A C T G TTGAACCAGTA GC TCAGTTTG CCAC T G G A A T a909_ail .seq. 
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ABgnment Report of «-i WO 2006/078318^^^: m . M . M ' ^PCT7US2005/027239 

n^*««m wpon«fti-i_ wy „ liro ,^ «o» y -u. . rem loelhod wdh Weighted residue weiqht table 
rhutsday. Jtrfy 29. 2004 5:46 PM ~~ 

C A G T A T TjHTitA Tgt TOj ttjgg f g f ftgg AAGA ACCCC C ACCGA AAACAA Majority ! 

* 24 . 60 2470 2480 2490 2500 

J^3 ATTGTAA «AGCTGCAGAAGTCTCACAAGAACCCC'CACCGAAAACAA 18rs2I ailsea 

2449 G.A G T A T T GTAAGAGCTGCAGAAGTGTCACAAGAACGCCCAGCGAAAACAA ciblH ail sea 

H% GAGTATTrTAArArrxr GA S ^ 

2450 CAGTATTCTAAGAGCTCCAGAAGTCTCACAACAACCCCCACCGAAAACAA aOOS^aTl.^eq 

CAGTAAA TATCTA TAAATTACAAGCTGATAGTTATAAATCGGAAATTACT Majority 
_ g^g 2520 253Q 2540 2,5*50 

^7 C-AC? A A f I Ylfl A .l f AATT 'ACAAGCTGATUTTATAAATUCAAATTAci 2603.all.seq 
5267 CA CT A A A T A T C T A T A A A T.T A C A A C C T G AT A G TT A T A A A T C G G A A A T T A C T 18rs21 ail s^n 
moo CAG T AAAT - ATCTA TAAATTACAAGCTCATACTTATAA A T C C G A 1 A ^ ^ f 

CAT? tZrltl AAATTACAAGCTC ATAGTTATAA ATCGCAAATTACT cjb^aiHeq 

S AG I AAA I A J^ ATAAATTAC AAGCTGATAGTTATAAATCGGAAATTACT *Lsi6 all.sSi 
1500 CAGTAAATATCTATAAATTACAACCTGATACTTATAAATCGGAAATTACT a909_alKseq 

TCfAATGCTGGTATCGAGA ATAAAG A C G G C G A A C T A A T A TCTAACTATGC Majority 
', 2 y° 2570 2580 2590 2600 

SJ5 ZZZi tl TG G T A T C G A CAATAAAGACGGCGAACTAATATCTAACTATGC 2603 aii.seq 

317 TCTAATGGTGGTATCGAGAATAAAGACGGCGAAGTAATATCTAACTATGC 18rs21 ail leo 
22 t"#» ^AATGGTGG T A TCGACAATAAAGACGGCCAAGTAAT A TCTAACTATGC alt seT* 

?r^x^ A 'T^^ TG ^ GTATCGAGAATAA ACAC CGCGA A GTAAT A TCTAACTATGC cjblll ait^eq 

to tV Z Mx^ G ^ GTATCGUUTAAACACGCCCAUTAATATCT A A. CTATCC ni.316~all.35 

550 TCTAATCGTGGTATCGACAATAAAGACGGCGA.AGTAATATCTAACTATCC a909„a7l.seq 

TAAACTT GGTG A C A AT GT A A A AC GTTTGCAACCTGT A C A C T T T A A ACGTT Majority 

■ 26 , t0 2620 2630 . 2640 2650 

2! 1 A A A £ TTGG. TGACAATG TAAAAGGTTTGCAAGGTG-TACAGTTTAA A.C G T T 2603 aii.seq 
-I I f i * ^ T T C G T G A C A A T G T A A A A C G T T T G C A.A G G T G T A C A G T T T A A A C C T.T 18rs5l all^eq 
22 «r AAA ^"X^ GGTG ACAATGTAAAAGGTTTGCAAGGTGTA CA. GTTTAAACCTT cohl all. seq 

r AAA ^J. G S TGACAATGTAAA AGGTTTGCAAGGTGTACAGTTTAA ACGTT cjblll ail^eq 
^ > A * * * 3 3 G ° T G A C A A T G T A A AAGGTTTGCAAGGTGTACA.GTTTAAACCTT ni*316~ail.sS 
BOO TAAACTTGGTGACAATCTAAAACGTTTGCAA G GT CTACAGTTTAAACCTT a909_a7KseT^ 

ATAA ACT C A AG ACGG ATA T.TTCTGTTGATGAATTGAAAAAATTGACAACA Majority 
2670 2680 2G90 2700 

^5 A 3 A M^IS AAGACGGATATTTCTGTTGATGAATTGA aii.seq 
£ A? A A trJn t f S AGGGATATTTCTGTTGATGAA TTCA AAA AATTG ACAAC-A ISrs^aiHeq 

J MS^ J T T ! TTTCTGTTGATGAATTGAAAAAATTGACAACA cohl_ail.seq 

A 3 AAAG 3 GAAGACGGATATTTC TGTTGATGAATTGAAAAAATTGACAACA cjblll ailTseq 

A J AAAG I GAAGACGGA TATTTCTGTTGATGAATTGAAAAAATTGACAACA ne*3 16~an .sea^ 

.50 A T A A A G T C A A G A C G G A T A T T T C T G T T G A T G A A T T G A A A A A A T T G A C A A C. A a9Q9.laH.seq 

G T T G A A G C A C C A G A T G C A A A A G T T G G A A C G, A TTCTTGA A.C. A AG GTCTC AG Maloritv 

■ . 27 /° 2 ^ 2Q , . 27 ?° * 2 740 2750 

J 9 ? C T T G A A G C A G C A G A T G C-A A A A G T T G G A A C G A T.T C T T G A AG A A GGTCTC A. G 2603 ail, seq 
SI nZZi *'t f 9 V^ C A G A J G C A A <3 T 5 G G A A C G A TT CTTCA AG A AG G T GTCA C 18rs2l - al l Tseq 
£2 °. ZZ I t A - G ^P;-A ^.C.*A G A, T G C A A -A A';.G'.T T- GIG A A G;G A T XG T-T G. A A G A A G C T G t C A G ^coht iti V leq 
™ r Z Z 5' Z?M:*'?>$$ ^ 'H :A - ^ T T G ^ A ^ e <?. A T T e- T j G A AT G A A G G T G T C A G cjfelil ail^eq 

^ $ZZ ^'f5v : A:^ A '' € 'Al G ,c a a:a a g : t gg a a c g a t t c t t g a a g a a gg # c^r c AG i^fei!'^ 

rOO G T T G A,A G^C A G;G:A : G^T/G W*W**'CTte:3fT*!r'.t'tf A A G A Afi,G-TGT C AJg a909 all seq 



; t Ct A aiX A. A A>AAi ACT A ^A ^G C fcC A AG GTTT G GT C G T C C A^f GC t C TG G A T T Maloritv 

. : \ 2™° .. . . : 2780 .; . 2 790 . 2800 

!f! I C T AGC T C A A A A A A' "C T A* A T G C T CAAGGT TTGG T CGTC GAtG.CTCTG.GAT T 2603 all seq 

III W * A C C T C A A A A A ACTA A T.G C T CAAGGTTTGGTCGTCGATGCTCTG.GAT T ISrsIl ail seq 

it ZZZ t° C T C A A A A A A C t A A T G C T C A A G GT T T G G T C G T C G A T G C T C T G C A T T cohl ail,seq 

49 T C T AC CTCAAAAAACTAATG C'T CAAGGTTTGGTCGTCGATGCTCT GG ATT cjblll ail sea 
^ I ^ I A G G T C A A A A A A C T A A T G C T.C A A G G T T T G G TC G T C G A T G C T C T G G A T T nem316 aiiisS 

50 T C T A C C T C- AAA A A AC T A At G CT C A AC GT TTGG TC GTC G A T G CTCTG C ATT a909 aU seq 
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2848 
2817 
2849 
2849 
2851 
1&50 



^e rtR ^ofAi-i3LQ-.»i9J8318 met J^ ^PCT/US2005/027239 

Thursday. Juy 29. 2004 5:46 PM 

C A A A A A Cjp r^"f"q/ti|Gl^^ g^Bjl jSj £1 GA AGATTTAAAGA ATTCACCT Majority 
" 28/0 282Q 2830 2840 2850 

2798 C A A A A A C T A A T G T G A C A T A C T T C T A T G T A C A A C A T T T A A A G A A T T C A C C T 2603_ail seq 
2567 CAAAAAGTAATCTGACATACTTCTATGTACAAGATTTAAAGAATTCACCT 18rs2l_ail seq 

2799 CAAAAACTAATGTCAGATACTTCTATGTACAAGATTTAAAGAATTCACCT cohl_aii seq 

2799 CAAAAAGTAATGTGAGATACTTGTATCTAGAAGATTTAAAGAATTCACCT cjbll l_ail seq 
2801 CAAAAAGTAATGTGAGATACTTGTATGTAGAAGATTTAAAGAATTCACCT nem316_ail seq 

2800 CAAAAAGTAATGTGAGATACTTGTATGT A X *G AAGATTTAAAGAATTCACCT a909_ail.seq 

TCAAA CATTACCAAAGCTTATGCTGTACGGTTTGTGTTGGAATTACCAGT Majority 
2860 2870 2880 2890 2900 

TCAAACATTACCAAAGCTTATGCTGTACCGTTTGTGTTGGAATTACCAGT 2603_ail seq 
TCAAACATTACCAAA.GCTTATGCTGTACCGTTTGTGTTGGAATTACCAGT 18rs21_ail .seq 
TCAA ACATTACC AAAGCTTATGCTGTACC-GTTTGTGTTGGAATTACCAGT cohl_aiI seq 
TCAAACATTACCAAAGCTTATGCTGTACCGTTTGTGTTCGAATTACCAGT cjblll_ail seq 
TCAAACATTACCAAAGCTTATGCTGTACCGTTT.GTGTTG.GAATTACCAGT nem316_ail seq 
TCAAACATTACCAAAGCTTAT G C TGTACCGTTTGTGTTGGAATTACCAGT a909_aU.seq 

TGCTAA.CTCTAC ACGTACAGCTTTCCTTTCT C A AATTAATATTTACCCTA Majority 
29/0 2920 2930 2940 2950 

5898 TGCTAACTCTACAGGTACAGGTTTCCTTTC T.G AAATTAATATTTACCCT A 2603_ail seq 
5667 TGCTAACTCTACAGG-TACAGGTTTCCTTTCTGA A A TTAATATTTACCCTA 18rs21_aii seq 

5899 TGCTAACTCTACAGGTACAGGTTTCCTTTCTGAAATTAATATTTACCCTA cohl_aiI seq 
5899. TGCTAAC T-C TACA. CGTACAGGTTTCCTTTCTGAAATTAATATTTA C-C C T A cjblil.ail seq 
5901 TCCTAACTCTAC A G GT A C A G'C.TTTC CTTTCTGAA ATT AATATTTACCCTA ne*316_ail*seq 

5900 TCCTAACTCTAC AG G T A CHCTTTCCTTTCTGA AATTAATATTTACCCTA a909_a i 1 . seq 

AAAAC GTTGTAACTCATGAACCAAAAACACATAAAGATGTTAAAAAATTA Majority 
' • • 2960 2970 2980 2990 3000 

948 AAAACGTTGTAACTGATGAACCAAAAA C A GATAAAGATGTTA AAAAATTA 2603__ail seq 
J717 A A A A C C T T G T A A C T G A T G A A C C A A A A A C A G. A T A A A G A T G T T A A AB A A T T A 18rs211ail seq 

949 AAAACGTTGTAACTGATGAACCAAAAACAGATAAAGATGTTAAAAAATTA cohl_aii seq 

949 A A A AC G T T G T- A A C T G A TG A A C C A A A A A C A GAT A. AAGATG'TTA AAAAATTA cjblll_aii seq 
951 AAAACGTTGTAA CTGATGAACCAAAA ACAGATAA-AGATGTTAAAAAA-TTA nem316_ail,seq 

950 A A A A C C T T GTAA CTGATGAACCAAAA A' CAGATA A AG AT GTTA AAAAATTA a909__aiKseq 

CCTC ACCACCA T GCAGGTTATACGATTGCTGAAGAATTCAA ATGGTTCTT Majority 
. . 30/0 3020 3030 3040 * 3050 

GGTC.AGGACGATGC AGGTTATACGATTGGTCAAGAATTCAAATGGTTCTT 2603__aii seq 
GGTCAGG'ACGATGCAGGTTATACGATTGGTGAAGAATTCAAATGGTTCTT 18rs2i_aii seq 
GGTCAGGACGATGCAGGTTATACGATTGGTGAAGAATTCAAATGGfTCTT cohl ail. seq 
GGTCAGGACGATGCAGCTTATACGATTGGTGAAGAATTCAAATGGTTCTT cjblll_aliseq 
G. GTCAGGACGATGCAGGTTATACGATTGGTGAAGAATTC A A ATGGTTCTT nem316_ail seq 
GGTCAGGACGATGCAGGTTATACCATTGGTGAAGAATTCAAATGGTTCTT a909_aiKseq 

G A A AT C T A C A A T C C C T G C C A AT T T A G G T G A C T A T G A A A A A T T T G A A A T T A Majority 

3060 3070 3080 . 3090 3100 

Q48 G A A A T C TAX A AT C, C C TG CCA ATTTAG.GT G ACTATG A AAAATTTGA A A-TTA 2603_ail.seq 
W G A A A T CI AC A A T C C C T G CCAATT T;A G G T G A-C f A T G A A A A A T T T G A A A T T A 18rs2liaU.seq 
349 G A A A T C T A C A A T CCX TG C C A A T T T A G G T G AC TAT G. A A A A A T T'TG A A A T T A c^hl-ail.seq 

349 G A A AT C TACA A t C CtT G.G C A A f TT A G GT G A C T A T G A A A A A T T .T G A A A T T A cjbilliialt seq 
tel;.. A AT C TACAA T C C C TJG.C C.AM- T T A <3 <j T-G^AvC T '>T G A A A>A A TTT G A. A A T T A ne^lfe^il^ 

350 G A A A T C T A G A A T .C C C.T G C C A A T T ;T ;A> G\ G t G TYA' T G • A, A AAA TT T G A A A T t A ha6£i^ja^-' 

' C ^ CAT AAA T T T G C A G A T G CX T T G A C • T T. A T A A A f C T ;C T T G G A A A A A T C A A G Majority 
3110 3*20 . 3130 3140 3150 



767 
999 
999 
001 
000 



)98 C T C AT A A ATTTG C A-G AT GGCTTGACTTATAAAT.CT GTT GG A AAA A TCAA G~ 2603_ail.seq 

367 CTCATAAATTTG C A G.ATGG. CTTGACTTATAAATCTGTTG G A A AAATCAAG 18rs2Uall seq 

599. C TG A T A A AT T T G C A G AT G G C T.T G AC TT AT A A AT C T GT TG G A A A A A T C.A A G GohtL.all.seq. 

J99 CT"GATAAATTTGCAGATGGCTTGACTTATAAATCTGTTGGAAAAATCAAG ciblll^ail seq 

101 CTGATAAATTTGCAGATCGCTTGA.CTTATAAATCTGTTGGAAAAATCAAG nem316^ail.seq 

100 C.Tfi ATA AATTTG GA G AT-G G'CTTGACTTATAAATC TG TTGGAAAAATCAAG a909„aii.seq 
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• 27/4*7 

^mentReportof AI-1_ a WO 2006/078318 ittK>d XweigWed residue we^ht table ^PCT/US2005/027239 _ 

hursday.Juty 29. 2004 5:46 PM J Page 10 

A T T C C T T <pn pjSSjgQ jjgg T ^g g El » CACTAC ACT ATTCATCAACC "Majority 

31 . 60 3170 3180 3190 3200 

"« A Z Z 9, rTZ^!?^ AA ACACTCAATACACATC AC C ACT ACACTATTCATGAACC 2603 ail.seq 
M !II^ TTCGAAAACACTGAATACAGATCAC CACTACACTATTCATCAACC 18rs21 allien 
\ll I 5r CGAAAACACTGAATAGAGATGAGCACTACACTATT GATCAACCcoh ill s^T 

I CTTCCAAAACACTCAATAGAGATCAC CACTACACTATTCATGAACC cjblll aiHeq 
51 ATTC GTTCGAAAACACTGAATAGAGA T G AGCACTACACTATTGATGAACC nem316~aii *s2 
150 ATTCGTTCGAAAACACTGAATACACATGACCACTACACTATTGATGAACC a909_a7l .seq 

AACAGTTCATAACCAAAATA CATTAAAAATTACCTTTAAACCAGAGAAAT Majority 
32 . 10 32^ 3230 3240 3250 

V™ AA C A CTTCATAACCAAAATACATTAAAAATTACGTTTAAACCACACAAAT 2603 ail.seq 
^1 A a n a ^T»-ir^^ T^AACCAAAATAC AXXA AAAATTACGTTTAAACCAGAGA.AAT 18rs2i ail^eq 
I™ AA C A GTTGATAACCAAAATACATTAAAAATTACGTTTAAACCAGAGAAATcohl ail.seq 

AA J^n^ ATAACCAAAATACATTAAAAATTACCTTTAAACCAGAGAAA T ejblTl all?seq 
™ A^^r*^^T^^A^T A r* AAAATACATTAAAAATTACCTTTAAACCA G'A C A A A T ne^lelaii .seq^ 
200 AACAGTTGATAA CCAAAATACATTAAAAATTACCTTTAAACC AC AG A A AT a909_a7l.seq 

TTAAACAAATTGCTGAGCT ACTTAAAGGAATGACCCTTGTTAAAAATCAA Majority 
3Z ?° 3270 3280 3290 33*00 

111 TJAAACA AATTCCTCAGCTACTTAAACCAATCACCCTTGTTAAAAATCAA 2603 ail.seq 

Ibl II. AAAGA AATTGCTGAGCTACTTAAAGQAATGACCCTTCTTAAAAATCAA 18rs2I all /seq 

249 TTA.AACAAATTCCTGAGCTACTTAAAGGAATGACCCTTGTTAAAAATCAA cohl ail.seq 

S49 TTAAAC.AAATTGCTGACCTACTTAAAGGAATGACCCTTGTTAAAAATCAA cjbl7i_all Iseq 

^ IJ AAAC AAATTCCTGACCTA.CTTAAACCAATGACCCTTGTTAAAAATCAA ne*3I6 ail.seq 

ISO TTAAAGAAATTGCTCAGCTAGTTA.AAGGAATGACCCTTGTTAAAAATCAA a909_a7i.seq 

GATGCTCTTGATAAAGCTACTGCAAATACAGA T C A T CCCCCATTTTTQCA Majority 

_ 33*0 • 3320 3330 3340 3350 

!98 GATGCTCTTCAT A A AC CTACTGCAAATACACATCATGCGGCATTTTTGGA 2603 ail seq 
*7 GATGCTCTT-GATAAAGCTAG.TGCAAATACA. GATGATGCGGCATTTTTGGA 18rs21 ailTseq 

^ A J^ CTCTTG AT AAAGCTACTG.C A A AT AG A GATG ATGCGGC ATTTTTGGA cohl ail. seq 
£9 GATGGTCTT G A TAAACCTACTG C.A A A T A C A GAT C A T.G C G G C A T T- T T T G G A - ejblTl ailTseq 
^ • GA TCCTCTTGA.TAAAGCTACTGeA.AATA.CAGA.TGATGCGGCArrTTTCGA nem316~ail.seq 
100 CATGCTCTTC AT A AAGCT A CTGCAAATAC A GATG ATGCGGC ATTTTTGGA a909_a7l.seq 

A ATTCC AC T'T CCATCAACT ATTAATCA.AAAAGCACTTTTACCAAAAGCAA Majority 
gg60 3370 . 338Q 3390 34O0 

548 AATTCCACTfGCATCAACTATTAATGAAAAACCAGTTTTACGAAAAGCAA 2603 ail.seq 
aI AA 3I^ AGTTGCATCAACTATTAATGAAAAAGCAGTTTTAGG I8rs21 ail^eq 

1o AA TI^ A ^ TTGCATCAACTATTAATG AAAAAGCAGTTTTAGGAAAAGCAAcohl ail.seq 

A J TGCAGTTG CATCAACTAT.TAATGAAAAAGeACTTTTACGAAAAGCAA cjblll all!seq 
f n AA TTCCAGTTGCATCAAGTATTAATGAAAAAGCAGTTTTAGGAAAAGCAA ne^ielai 1 .seq 
SO AATTCCAGTTGCATCAACTA.TTAATGAAAAAGCAGTTTTAGGAAAAGCA. A a909_ail.seq 

,T T G A A A A T A C T T IT C A,A C T T C A A T A T C A C C AT AC TC C T C A T A A A C,C T C A C Majority 

3410 . 3420 . 343Q . 3440 . 3450 

tl I * G A A A A T A C T T T T G A A C T T C A A T A T G' A C £ A T A C T C C T G ATA A A C C T G A C 2603 ail.seq 
tl 112 A A A A T A C T T T A AGt T: GA A TAf G A CCA T A CTC C T GAT A A A fi'C'TG AC :i8ns21 aiHeq - 
2 I 19 A A T A GTTTT G A A CjT TC A-A-T;A--T„ G A C C- A T AC T C.C T GAT AAA- 6 C T A C cohl: aTUseq 
99 T TG'A A AATA.CT-T T..T CA A G T T C A r-A t A t G: A C CAT. A CTCC T G A.'*.T-. A A A ft C T n A c Jiiitlr 
01 TTG A.AA A T AC T T 
OO TTG A A A A T A C T T 



* . V*j 1 ^ A A * * A ^ A c i;. A 1 A C T C C T G A T A A A G C T G A C cohlLail^seq ' 
r at.T CA A G T T : C A- A- T A T. G: A C, C.A T. . A CSCC T G;A;.T A A A G G T G A C c^lO^l^eq 
r T T G A A Q T> T C A A; T A T G A G'C AT ACT CG T G A t A A A G C T-iGA.C he^i^liil seb 

CI T T G* A A C T-,T;C A A ;T ;AT- GiA C- A T^ 'T C 1 C T G A T A A< A G^C ftc" A^a909ik'i^^ V : ' « 



• A AT CCA A A-A G'C A TC T A 'A T C- C T': C C A A G A' A A AC CA G A AG T T C. A T A C T]C G. T G G MaioritV 

: 3y0 -3480 • 3490 .. . 3500 

48 A A.T C C A A A A CC ATCTA AT CCT C C A'AG A A A A C C A G A A G -T T C A T A C T- G CTC G 2603 ail seq 
17 A A t C C A A A A C C A T C T.A A T C C TC. C A A G A. A AA C C A G A A G T T C A T A C t G G T G G 18rs2I_allseq 

49 AATCCAAAACCATCTAATCC T C C A AG A A A AC C A G A -AG T t C ATA C T G G T G G cohllalt seq 

49 AATCCAAAA C C A T C T A A TC C T C CA A C A AAACCAG AAGTTC ATACTGGTGG cjbill^allseq 
AA I GCAAAACCATCTA ATCCTCCAAGA.A'AACCA.GA ACTTCATACTGGTGC nem316 ail'seq 

50 AATCCAAAACCAT CTAATC'CTCCAAGAAAACCAG A AG T T C A T AC T G GT G G a909 all seq 
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«^i^<*a*WO.^ ^ ^PCT/US2005/027239 

ftujcsqay. July 29, 2004 5:46 PM ragen 

G A A A C G Ajjg[| jxS HA&ltj jflj AAACACAAACACTAGGTCGTC Matorltv 

3520 3S3Q 3540 3550 

J498 C AA AC C ATTTC T A A A C A A A C A CTC A A C AC A A AC'AC A A AC ACT A C GTCCT C 2603_ail seq 

CAAACGATTTCTAAACAAACACTCA AC AG AAACACAAACACTAGGTCGTC 18rs21 alTiea 

1499 GAAACCATTTGTAAAGAAAGACTCAACAGAAACACAAACACTAGGTGGTG cohl.ail seq 
U99 GAAACGATTTGTAAAGAAAGACTCAACAGAA AC ACAAACACTAGGTGGT G cjbill ailseq 
JSOl GAAACGATTTGT. AAAGAAAGA.CTCAACAGAAACACAAACACTAGGTGGTG ne^ielail seq 

1500 GAAACCATTTGTAAAGAAAGACTCAACAGAAACACAAACACTAGGTGGTG a909_ail.s^q 

C T>G ACTTTCATTTGTTGGC TTCTGATGGCACAGCACTAAAATGGACAGAT Majority 
• 3560 3570 3580 3590 3600 

C T.G A C T T T G A T T T G T T G G CT T C T G A T G.G G AC ACC ACT A A A ATGC A C AG AT 2603 ail. seq 
1317 CTGAGTTTGATTTGTTGGCTTCTGATCGGACAGCACTAAAATGGA'CAGAT 18rs21 ail seq 

1549 CTGAGTTTGATTTGTT G-G CTTCTGATGGGAC AG CAGTAAAATGGACAGAT cohl.ail seq 
S49. CTGAGTTTGATTTCTTCGCTTCTGATGGGACAGCAGTAAAATGGACAGAT cjbill ailseq 
1551 CTGACTTTCATTTCTTCCCTTCTCATGGGACAGCAGTAAAATGG ACACAT nemSie^ail seq 

1550 CTGACTTTGATTTGTTG.G CTTCTGATGGGAC AG CACTAAAATG G. A CAGAT a909_a7l.seq 

GCTCTTATTAAA CCGAATACTAATAAAAACTATAT T G C TCCAGAAGCTCT Major i ty 
3610 3620 3630 3640 3650 

598 GCTCTTATTAAACCGAATACTAATAAAAACTATATTGCTGGAGAAGCTGT 2603_.ail.seq 
367 GCTCTTATTAAACCCAATACTAATAAAAACTATATTCCTGCAGAACCTGT 18rs2I all seq 

599 GCTCTTATTAAACCCAATACTAATAAAAACTATATTCCTGGAGAACCTGT cohl.ail seq 
599 GCTCTTATTAAACCGAATACTAATAAAAACTATATTGCTGGAGAAGOTCT cjbUl ailseq 
60I GCTCTTATTAAACCGA AT ACT A A TAAAA AC TATA TTCC TCCAGAAGCTCT ne_i3t6~all seq 
6G0 G C'T CTTATTAAAGCGAATACTA AT AAAAACTATATTGCTGGAGAAGCTGT a909_ail_seq 

T A.C T'C C C C A A C C A A T C A A A T T.G A A A T C A'C A T A C A G A C C C T A CG TTTGAGA Major i ty 

. 3660 . 3670 3680 3690 3700 

648 TACTCC CC A A C C A A T C A A A T T C A A A T C A G A T A C A C A C G G T A C G T.T TGAGA 2603_ail seq . 
417 T A C T G G G C A A C C A A T C A A A T TG A A A TX A C A T AC AGACGGTACGTT TGAGA i8rs21_aU seq 

649 T A C T G G G C A A C C A A T C A A A t T G A A A T.G A C A T A C A G A C G G T A C G T T T G A G A cohl.ail. seq 
649 T AG T G G G C A A C C A A T C A A A T T G A A A T C A C A T A C A G A C 0 G T A C G TT T G A G A cjblll__aii seq 
651 TACTGGGCAAC. CAATCAAATTGAAATCACATACAG.ACCGTACGTTTGA G A nem3l6_ail seq 

TACTGCGCAA.CCAATCAAATTGAAATC.ACATACAGACCGTACCTTTGACA a909_ail seq 



6SO 



TTAAAGGTTTGGCTTATG CAGTTGATCCGAATCCAGAGGCTACAGCAGTA Majority 

3710 3720 3730 3740 3750 

— * 1 1_ 



698 TTAAAGGTTTGGCTTATGCAGTTGATG'CCAATGC'AGAGGGTACAGCAGTA 2603_all seq 

467 TTAAA.GGTTTGGCTTATGCAGTTGAT.GCG.AATGCAGAGGGTACAGCAGTA 18rs21_aiKseq 

S99 TTAAAGGTTTGCCTTATGCAGTTCATCCGAATGCACACCGTACAGCAGTA cohl.ail.seq 

TTAAACCTTTCGCTTATCGACTTGATCCGAATCCACAGCGTACAGCAGTA cjbUi.aU.seq 

.TTAAAGGTTTGGCTTATGCAGTTGATGCGAATGCAGACGGTACAGCAGTA nem3l6_ail seq 



rat 



,w * x 1 ft tt ft u u 1 riuuoi lAiixCAU'i iGATGCGAATGCAGA. GGGTACAG. CAGTA nea3l6_ail seq 
70O TTA'A A G GTTTG'G CTT ATG C AGTT G ATGCG A ATG C AG A GG GT AC A G CA Gf A a909_aH.seq 

ACTTACAAATTA AA AG A A A C. A A A A GCACCA GAAGGTT ATQTAA.TC CC TG A Maj or i ty 

3760 3770 3780 3790 • 3800 

748 ACT TAC.A A ATT A A A AG A A AC A A A A G C A.6 C AG A AG GT T AT GT A AT C CC T G\A 2603 ail. seq 
517 A C T T A .q A A AT T A A A A G A A A G A A A A G C A C C A G A A G G T T A T G T A' A T G C C T G A I8rs5l__ail ,seq 
T49 A CT T A C A A AT J* A A A: A g;A-A A C A AA A G C A G.G A G A AG C'T T A TGT A A't. C C C T G A : cohUall.seq 

l A A T T A A-.A .A G A A. A C A A A A G C AG ,C A G A A.G GjT A T G-T A A T C C CTC A" cjbllljjaH seq 
A A T T : A A A A G A A,A C A A A A G C A £ C A G A A G G'TT A.T G TAi ^-Giil^AiiiS^^kfi. 
i A A T T A:iA^A:.A-G>A A ; A C A A' A A G C A C C A G A;A G G T T' A T G T AV^TlG-fCi^C 'T>G : A':a§Q9j_^3Kse^"" *'"' 

T A A.AG/A A A TC C A: G. TT T- AC A G T A.T C A. G A A AC A'T C T T A T A -A T A C i-kVA- AC C A A M^brityr 

3810 3820 3830 . 3840 3850 

m *T A.A AGAAATCGA G T.T T A O A G T AT C A C A A A C A.T C TT AT A AT AC A A A AC CA A 2G03_ail.seq 
567 T A A A G A A A T C G AX T T T A C A. G T A T C A C A'A A C A T. C 'T T A T A A T A C A A A A C C A A lSrs^aH .seq 
r99 T AA A G A AATCC A GT TT A C A GtATC AC A A A C A T C T.T A T A A T A C A A A A C C A A cohl_*iKseq 
r99 *T A: A AG A AATCGAGTTTACAGTATC AC AA A C A T C T T.A T A A T A C A A A A CCA A cjblHjall^seq 
101 TAAAGAA ATCCA C T TTAC A.QTATC ACAAACATCTTAT AAT.ACA AA ACC AA nen.316 ail,sS 
JOO T A:A A G A A ATC G.A GTTT AC A G TAT C A C A A A C A T CTT A T A A T AC A A AACCAA a909_JaU.seq : 



T49 A CTT AC A 

rsi a c;t T A C A 

T50 A C T T A X A 
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0 m/t+zi 

^ ^^.I^W^ 006707831 8 with V*®*"* residue weight table. ^PCT/US2005/027239 

~ T G A c A Tit^fbT-c-f *tfvo&vi ear nm r ax; 



3 ^ 0 3870 3890 



38"8 CTGACATC ACCC T T G ATAGTGCTGATcrAAfAr-^ 

3617 CTCACAT.CACCCTTCATAG^C^^Gcl T " 1 i 2603 aU.seq 

3920 3930 J 

3«99 AACAAAC.GTCCTTCAATCCCTAA^CTCC?cr?^>^^ C(:(;CIATCTT 18r S 2^ail^eq 

3S00 -c^cctcctt^^c*.^^ 



j • — « o9ou 3990 4nnn 

1950 



TGTCGCTAT C.C G T OCT GCGCTCATGGCTTTTrr I'S ttii *"''' <JMllj.il.seq 

■»CCnCA.AU t iTIACTUniAU Ct CT^TTCTTA^n ? tfaJority 

4010 4020 - 4030 4040 * ~ T 



1767 
1999 
1999 
lOO I 
WOO 



D48 
Si 7 
049 
049 
OSl 
050 



" — ■ . , i — - i - -*w*u 4O50 

\ I ^^^^ -oa_ aI ,. seq 

GTCCTACAAA.AGAT'AACTAAATAAAAGCCTArT'T^TT^^ I8rs21_aU .seq 

0 T T §G T ^ A A A ^G G A A T T A A A A c C TAA -A A 1 i Hs!?H?? I? ?H 1 « " « " 

CICCIACAA4AGATAACTAAAIAAAAC G'C T A CTT CTT4 AGIAACCATGTT agO^'aiu'seq^ 

ta.AAAA^AAATAGCCfT.TtTOTCTtTTtrTCGTTT . 

— 4 °- 6 ° 40 . 70 4080 7^ 

^ * * Aa'a'aGAC.a'-a'atACCC^ 2 603_ alt .^ 
THCAAAACACAAA'TAGCCTT'ArTI C T C T T T T T r t r ? J i '8rs21_aIKseq 
TAACAAAAGA.GAAATACCCITA^JcJc^^^Jr^^r'"* 10 *^^^ 
TAAG AAAA GAGAAATAGCCTTATTTCTCTTTTTrTrrTTT cjblll_ail!seq 
^ A AC A A A AGAGAAATAGCCT. TAT^T^. T^T^T^T^T^C^T^T^AA^A^AAA aS^'aTi 'seq 
.CGAACATC A TG A AA C A < A G ft T T t A A A C T T < T G T TT.I C T T T T I 1 t c t t r it ■■ ...,,„, 
i ■ 4 '.'° I™ . 41 30 4140- ^0 

099 6 B * A C-A fC A/T C AA AC A a£H t || T t T * T VIVZl ll^fi ™* ISfS^allSeq 

099 C G A A C A T C A T C A A A C A A A C A t * A A^l f-f "5 T'"» "T Z I ZH $ T ^ T V? P ; T G « O A * G f^jM,^ 
m COM; A c.ATeATG^AAC A iHiM*;.M 
- G G.AA G.A T C A T G. A-A AC £ A- 'A- C A* it: 

. • * f ffi T- 1 T C C-A ATT 4^^ ^ - 
4160 * • 41=76 . - - - -■. 7^ \ • "• . 

i — - i ■ . 41 80 4190 4200 

S Ttlo GC tg TA ^ T ^^ A ^^^X A 

149 T T A C G C A ■ C T A T C T T T G G A A T T A G C C A A A C T 6 I 1 1 1 A C ^ A A ^ * * A C T C A 18re 2 I_alHeq 
151 TTAGCGACIA T"C T°T TCCA^-TTAG^C^^i^^ t3 I T T I£ AA^©TC A <^MU.^ailtseq 

150 TT J C G G A CTATCTTTG GA ATT ACCCA AACTCTTT^ AG C G C A AG A A AC T .C A "aSO^aTl'setj 
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Slg^jgg^^^ ^PCT/US200 5 /027239 



^^LACtt^I^^ 

, i *™ ^ "23° 4240 So 

4 199 TCACTICAC TTCnc I ATTCATCCTCCAAATC ISr^ait^eq 

4199 TCAGTTGACCATTCTTCATCTTGAAGCAArrrAT^T'''^^ ^"^^CAAATC cohl_ail.seq 

4300 tcacttcaccattcttcatc^c^ac^ag 0 ^^^^^^ 

CA.CAGTTGCAGATTGCCCCT AAAGA AGG G ACTCCAATTCAACGACTArTr Major.* 
^ 4 * 7 ° . 4280 4290 

^ CACAGTTCCA C A TTCCCeCTAAAGAAGGrArTrr AA ^^^^ A ^^ A ^^ A ^^*' 2 **03_all'.se<j 

4249 CACAGTTGCACATTCCCr cl AAAGAAe G CACTCC AATTCAACCAGTACTC ISr^alHeq 

4249 CACACT UCCC « A.rrr A £ ™ CA A T T C A A G G A G T A C T C co„l_aIl.s«P 

€251 CACACT « ATICCCCcJ ^ r^*^^ ATTC4ACGACTACTC cjbm^lHeq 

«250 CACAGTTGGACATTGCCCCTAAAGAACCGACTCCAAT C A ACGACTACTC a90SL.a7t!seq^ 

TA.TCACTTGTACCAATTAAAATCAACTGAAGATCr.nr. A TTTCTTGr.nAn, „^ orIty 
/ ,3 . , ° 4320 4330 4340 . . 4^0 " 



X TATC AC IT CTACCAATTAAAATCAAC T r A A r A T ** C C C A T T T C T T G C C A C A 2603_^all.seq 

1299 TA'TCACTTGTACCAATTAAAATCAACT^'A ISrsZi^U^eq 

1299 T.ATCACTICTACCAATT^!«Tr AA JI5! A J A CCC<:ArTTCTT(:CCACA cohl.all.seq 

1301 TATctc?Ic?ACCAX^AAl^CAAC^G^ 

.300 TATCACIICIACCAAITAAAATCAACTCA A4AI fi'c C A I.TT CT T C C C A C A a^9^.a7l Iseq^ 

TTCCAATfCCCTAACTATCACAGAATTGAAAAAACAr. r. CGCAGCAr.r.TT-r »^,^. 
4380 4390 ~ 

5S ^C^TT^C^A^^ 

135 1 TTG^A^T^C^C^AAC^A^C^CAGAA TT G A A A A A A r> t S S S S ° * P C X GG T T T cJbUI.aiHeq 
*50 T It G A AT TCC'CTA/VCTA. TC AC A G.A ATT G.A A A A A A C A ff 6 C G C AC'C A G G T T T Sjll.'s^ 9 ' 

• TTGAAGCCACTACT'AA-TGAACAAGGAAACGCTACAT TT.AACCAArTArrA Majority 
— . ^ ffO 4430 4440 7^0 

5 A A cTtllflTlic Ja * * T T T A * C 15 A A C T A C d A Wrsi^^q 

£ TTGAACCCACTACTAAT^ ^UljlitSeq 
- TTGAACCCACTACTAATCAACAA G'G A A AG 6 C T A C.A TTTAACCAACTACCA a90^.a7l!seq 
GATGGAATTTATTATCGTCTCCC.CCTTA A A G C C C C T n. A A A A A A A T n C. T A A mjorlty 
4460 . . .. . 4480 4490 t~ 




267 
499 
499 
SOI 
500 



T G T C T.C A C C tt t,C T T G G T T G Ac T' t C'T'ciT- G.iA/cVo^A. ft' aI 'A&a-. c "^^G;A. jT-T'T-A T C n j^^rity '' 

-. 4? . 10 4S . 2 °. .. : ; 4539 :;, • .. 4540 ^ • 

T G Tt T C ACCTTTCTTCCf T C A r 1 1 r I tt ^ A * A A A GTGATTTATCC T A 2603 aU. S eq 
^cTcTg.C^TT^tIc^tTc^^^ 

ItIC,TCACCniCITCC.TTCACITCTC cu n ' « "ro T C d't A ^ 7? ^ 
TG-TCTC AGCTTTCTTGC IT CACTTCT'CTGAGGA'TAAAGTCATTTATrPTA J < ^ 

T'CTC TCAGCTTTCTTGGTTGA'CTTG-TCTGA C t A T A A A G T d A T T T A T C C T A aOGS.^aTKseq 6 ''' 
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* 3//4S7 m 

MsnmentReportofAW WO 2006/0 783 1 8 ^J^w^k^ ^ . ^PCT/US2005/027239 

SjT^Iws^^ Page 14 

A A A T. C A 1 "Cfccl c'ii Rg(£i ggg^ 14 TTGCT TAAACTTCCTCTCCAT M aToTTT^— = \ 

45 t 6Q 4570 4580 * 4590 4G00 

m7 A ^ A I^T^I^^^ A ^ AGCTGA GTTGGACTTGCTTAAAGTTGGTGTCGAT 2603__aU.seq 
TaI AAA I GA I" GGTCCACAGGTG AGTTGCACTTGCTTAAAGTTGGTGTGGAT 18rs21 alHeq 
\lf 0 AA ATCATCTGCTCCACACCTCACTTGGACTTCCTTAAAGTT.GGTCTCCATcohIaTl. seT* 

*551 AAATCATCTGGTCCACAGGTG A GTTGGACTTGCTTAAAGTTCGTGTGGAT nem3l6~ail sea 
1550 AAATCATCTGCTCCACACGTCACTTCGACTTGCTTAAACTTCGTCTCCAT a^alt!^ 

GCTGATACC AAAAAA CCACTAGCAGCCGTTGTCTTTGAACTTTATCAAAA Majority 
m 46 . 1Q 4620 4630 4640 4650 

r r "*r f* A *r a GGAAA AAACCACTAGCAGG'CGTTGTCTTTGAACTTTATGA AAA 18rs21_ail!seq 
tell GG ^ GA TACCAAAAAACCACTAGCAGGCCTTGTCTTTGAACTTTATGAAAA cohl_ail.seq 
^? GG 1 GA J A ^ GAAA AAACCACTAGCAGGCGTTGTCTTTGAACTTTATGAAAA cJbUl ail!seq 
^ GGTCATArrAA^A^AA iH ^ACTAGCA'GGCCTTGTCTTTGAACTTTATGA.AAA nitflolail .se^ 
1600 GGTGATACCAAAAAABcACTAGCAGGCCTTGTCTTTGAACTTTATCAAAA a909__aiKseq 

G A A T G G T A G G ACTCC T A T T CG T C TC A A A A AT GG C C T C CATTCTCAAG ATA Maj or i ty 

4€ | 60 4670 4680 4690 4700 

GAATGGTAGCACTCCTA TTCGTCTGAAAAAtGGGGTGCATTCTCAAGA T A 2603 all.seq 

M17 CAATGGTACGACTCCTATTCGTGtGAAAAATGGGGTGCATTCTCAAGATA 18rs21 ail sea 

5^- A A ^ GGTAGG ACTCCTATTCGTGTGAAAAATGGGGTGCATTCTCAAGATA cohl aTK seq 

^ A A J G G T A G G A C.T C-C T A T T CG T G T G A A A A A T C G G G T G C A T t. C T C A A G A T A cjblli all^eq 

51 M TACGACTCCUTTCGKTCA A A A ATCGGGT.GC ATTCTC A AG ATA ne^l6~ai l.seq 
K550 GAATGGTAGGACTCCTATTCGTGTGAAAAATGGGGTGCATT.CTCAAGATA a909_aTl.seq 

TTG ACGCTGCA AAA C AT TT A C A A A C A G AT TC AT C A C C C CA T A TC A C A A T T Majority 
' 47 t 10 4720 4730 4740 4750 

J*?* HGACCCTGCAAAACATTTAGAAACAGATTCAT C A GGGCATATCAGAATT 2603 ait. seq 

52 ^ GACGCTGCAAA ACATTTACAAACAGAT T C A T C A G C G C A T A T C A G A A T T 18rs21 aillseq 
^ G A G G G T G C A A A A C A T T T A G A A A C A G A T T C A T G A G G G C A T A T C A G A A T T cobl all.seq 

5^? T T G A C G C T G C A A.A A C A T T T AG A AACAGATTCATCAGGGCATATCAGAATT cjblli ailTseq 
™ I J £ a £ S S ^ . S A A A A C A T T T A G A A ACAGA TTCATCAGGC.CATATCAGAATT ne*316~ail .sea* 
1700 TTGACCGT G C A A A A. C A T T T A G A A A C A G A T T C AT C> GGGCATATCAGAATT a909_ail,seq 

TCCCGCCT CAT CCATGGCG ACTATCTCTTAAAAGAAATCGAGACACAGTC Majority 
4760 477Q 4780 4790 * 4800. 

ZHZ AT CC ATGC CCACT ATCTCTTAA AAC AA ATCCACAC A CACTC 2603 ail.seq 

551 3 G ^ GGGCTCATCCATGGGGAeTA TGTCTTAAAAGAAATCGAG.ACACAGTC 18rs21 aiKseq 

52 3 GGGGGG TCATCCATGGGGACTATGTCTTAAAAGAAATCGAGACACAGTCcohl all.seq 

25? I G CCGGCTCATCCATGGGCACTATGTCTTAAAAGAAATCGAGACACAGTC cjblli ail^eq 

™ I GCCGGCTGATCCA TGGGGACTATGTCTtAAAAGAAATCGAGACACAGTC ne*3nfail.seq 

1750 T C C G G G C T C A T C C A T G C C G A C T A T G T C T T A A A A G A A A T C C A G A C A C A G T C a909.la7l.seq 

A CC A T A T C A C A T C G C A C AC C C A C AG ACTCCTCTC ACT. AT TGAAAAATC A A Majority- 

] 4810 ' 4820 . 4830 4840 - 4850 

52 A GC A T A T C kG ATCGGAG A G : * e A G A G A C T G CT'GTG ACT A TT G A A A A A TC A A 2603 all.seq 



1567 
1799 
1799 
1801: 
1800 



A G C A T A T G A G A T C G*G' A C A.-G G <5 A G A G A C T C C T. G T G A G T A T. T G A A A A A T C A A 18r^21,aH .seq 




: A-iA A-'.-C A^ C ,t ^ A ^ 7 A : '!C ;t A A C G -A T T- G ! A A" A ^A ^T "A; AAA A A G T T CCC A C A C CT A-A A C T G Maioritv 
. • . . 486b 4870 \. . ' * 4880 4890 4900 



»48 A A A G A G T A A C A G T A A C 5 A T T G A A.A A T A A A A A A G T T C C. G A C A. C C T A A A G T G 2603 all.seq . 
fit? A A A C A G T A A C A G T A A C G A T T G A A A A T A A A A A A G T T C C G A C A C C T A. A A G T G 18rs21 ailTseq 
A A A C A G T A AC A GT A AC G A T T G A A A.A T A.A A A A AGTTCCGAC ACC T A A A G T G cofii_aTl.seq 
A A A C A G T A A C A G T A A C G A T T G A A A- A T A A A "A A A G T T .C C G A C A C C T A A A G T G cjblli ail seq 
AAA CAGTAACA GTAACG ATTGAA A ATA A A A AA G T T C C C A C A C C T A-A A GT G nei316 alKsS 
A A 'A C A G T A A C A G T A AC G ATTGAA. AATAAAAAAGTTCCG AC ACCTAA A-C T G a909 ail.seq 



1617 
[849 
1849 
1851 
1850 
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4898 
4667 



CAGGTCAGCAACACCCAATCGC Majority 
4 9/Q 4920 4930 4940 4950 

CCATCTCGACGACGTCTTATTCCCAAAACAGGTGAGCAACAGGCAATGGC 2603_aU seq 
CCATCTCGAGCAGGTCTTATTCCCAAAACAGGTCAGCAACAGGCAATGGC I8rs2l_ail .s, 



seq 



4899 CCATCTC. GAGGAGGTCTTATTCCCAAAACAGGTGAGCAACAGGCAATGGC coM aTl seq 

4899 CCATCTCGAGGAGGTCTTATTCCCAAAACAGGTGAGCAACAGG. CAATGGC cj b t7l_a 1 1 seq 
4901 CCATCTCGACGAGCTCTTATTCCCAAAACAGGTGAGCAACAGGCAATGGC nem3I6 ail seq 

4900 CCATCTCGACGAGGTCTTATTCCCAAAACAGGTGAGCAACAGGCAATGGC a909_aTl.seq 

ACTTCTAATTATTCCTGC TATTTTAATTCCTTTAGCCTTACGATTACTAT Majority 
_ 49 t 6Q 4970 .4980 . 4990 . 5000 

4948 ACTTGTAATTATTGGTGGTATTTTAATTGCTTTAGCCTTACGATTACTAT 2603_ail seq 
4717 ACTTGTAATTATTGGTGGTATTTTAAT TG CTTTACCCTTACCATTACTAT i8rs2l_aU !seq 

4949 ACTTGTAATTATTCGTGGTATTTTAATTGCTTTAGCCTTACGATTACTAT cohl_ail seq 

4949 ACTTGTAATTATTGGTGGTATTTTAATTGCTTTAGCCTTACGATTACTAT cjbll l_ai Iseq 
4951 ACTTGTAATTATTGGTGCTATTTTAATTGC T T T ACCCTT'ACC ATT ACT AT nea316_ail seq 

4950 ACTTGTAATTATTGGTGGTATTTTAATTGCTTTAGCCTTACCATTACTAT a909_ail.seq 

CAAAACATCCGAAACATCAAAATAACGATTAGCATGGCACAAAAATCAAA Majority 
5010 5020 5030 5040 5050 



4998 CAAAACATCGGAAACATC AA A AT AAGGATTAG C. A TGGG. ACAAAAATC A'A A 2603_ail seq 

CAAAACATCGGAAACATC AAA A TAACCATTACC ATCCGACAAAAATCAAA 18rs2I alllseq 
«999 C A«A A ACATCGGAAACATCAAAATAAGGATTACC ATCCGACAAAAATCAAA cohl.alKseq 
•999 C A A A A C A T C G C A A A C A T C A A A A T A A G G A T T A C C A T G G G AX A A A A A T C A A A cjbUl aiTseq 
5001 CAA A AC ATCGGAAACATCAAAATAAGCATTAGC ATCCGACAAAAATCAAA neriJuCail seq 
5000 CAAAACATCCGAAACATCAAAATAAGGATTAGCATGGGACAAAAATCAAA a?09_ail.seq 

A ATATCTCT AGC T AC CAATATTCC'T ATAT GC ATTTTTCGTTTAAT. TTTCT Majority 
5060 . 5070 5080 5090 5100 



5048 
1817 
5049 
5049 
5051 
>050 



i09S 
18C7 



A AT A T CT C t AC.CTACGA AT ATT CCT.AT ATGG AT TT.TTCG T T TAATT-TTCT 2603_aii seq 
A A T A T C.T CT.AG'CT ACC A ATATTCGTATATCG"ATTTTTC'GTTTA AT TTTCT 18rs21_ail seq 
A A T A T C T C T A C C T A C G A ATA TT..CG. TATATGG ATTTTT C G TTTAATTTTCT cohi_ail seq 
AATATCTCT AGCTA.C G A AT ATTCG.T ATAT GC ATTTTTCGTTTAAT TTTCT cjbHl_ail seq 
AATATCTCTAGCTACGAATATTCGTATATGGATTTTTCGTTTAATTTTCT nem316„ail seq 
AATATCTCTAGCTAC GAATATTCG TATATGG A T TT TTCGTTTAATTTTCT a909_all.seq 

TAGCCCGTTTCC TTGTTTTGCCAT T T CCCATCGTT. AGTCACCTCATGTAC Majority 

5**0 5120 5130 5146. 5150 

TAGCGGGTTT CC.TTG TTTTGC.CATTTCC C AT CGTTAGTCAGGTCA T-G T A C 2603_ail.seq 
TAGCGGGTTTCCTTGTTTTGGCATTTCCCATCGTTAGTCAGGTCATGTAC 18rs21_ail.seq 
>099 TAGCGCCTTTCCT.TGTTTTGGCATTTCCCATCGTTAGTCAGGTCATGTAC cohl.ail seq 
»99 TAGCGGGTTTCCTTGTTTTGGCATTTCCCATCGTTAGTCACG T C A T G T A C cjbill ail seq 
UOl TAGCGGGTTTCCTTGTTTTGGCATTT.CCCATCGTTACTCA.CGTCATGTAC nem316_ail seq 
ilOO TAGCGGGTTTCCTT.GTTTTGGCATTTCCCATCGTTAGTCAGGTCATGTAC a^ail.seq 

T T T C A A G CC TC T-CA C C C C A AT ATT A A T'C C TT TT A A'-A C A A.C C T CTT.A C C A A Majority 

^ 5 t60 5170 .518 0 . 5190 - 5200 

ii48 T.TT C A A.G\C CTCT CA C G -Q C A A T A T T A A T G C T T T T A A A G A A G C T GTtACC A A 2603_all seq 
917 T TT C A AG CC :T; C T C A C G C X A AT A T T- A ATCCTT T T A A A G A A GX T G T T A C C A 5 A 18rs2X. ail^sW 
;i49 TTTCAAGCCT C.T C AC CC C A A T AT TAATGCTTT T AAA & A C t GTT A C C-A A o6hii^Jl seq 

149 T T .T. C A A G C C T C. T CjA C,GX C A A -T A T T A A T G C"'*E T T T* A' A "A. G 11^ A G jG T ^G - THf 'A C X A. A Gj'billl.ai i v seq 
151 TT^rCAAGC C 'T'C! 'T.C.A C GC CA A T A T T V A A T GCTTT T A A A : G A^Ai&'CtT<G;T^T^X -C ' A*-Ar ii^ZYG^ali sed 

150 T TT C A AG CG T;C' T G 'Jfcfc GC X{A AT ATT AA T G C T T T'T G; A J^^V^X^.^ C ; ii^jsS^%l^ 

GAT T G A C C C G- C/T G G A G ATT A A T C G C C G T T T. A G A A C?T^T fec.fr T iA T^GX T T A T A Haibritv 
52i0 . 5220 5230 . 5240 5250 



198 G A.=T t G A CCCGCTGGAGATTAATCGGCGTTTAGAACTT CC T T A TG C TT ATA 2603__ail seq 
967 G AT T G A C C G G G T G G A G A T T A A T C G-G C G T T T A G A A C T T G C T T A T G C T f A T A 18rs2l ail seq 

199 G A T T G A C C G G G T G G A G A T T A A T C C G C C.T T T A G A A C T T G C T T A T G C T T A T A cohilatl seq 

199 GATTGACCCGGTGGAGATTAATCGGCGTTTAGAACTTGCTTATGCTTATA cjbIH_ail seq 
201 G A T T G A C C G G G T G G A C A T T A A T C G G C G T T T A G A A C T T G X T T A T G C T T A T A neo316^ail.seq 

200 C AT T G A C C G G G T G G A G A T T A A TC G G C G T T T A G A AC T t GC T T A T G C T T A T A a909„aii seq 
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a c c c c a »f ^•r g --cy < OTfe;cz a ? c c « a ; T ; T c e a g c | c t t a a r 

■ 5270 5280 5290 5300 

£>II V.^S! ATJUCACGTCCCiAAACTAATCGCCA *TATCCnCCCTTAAi 18rs21 ail sea 

5249 ACC.CCACTATAGCAGCTGCCAAAACTAATGGCGAATATCCACCCCTTAAA ™hi Ti 

5249 ACGCCAGTATAGCAGGTGCCAAAACTAATGCCC'AATATCCAGCGCTTAAA _J_m lifL, 
525! ACGCCAGTATAGCAGGTGCCAAAACTAATGGCGAATA T__ CACCGCTTAAA _!__!_~_ 

5250 ACCCCACTATAGCAGGTGCCAAAACTAATGGCCAATA T C CACCGCTTAAA ^U._^ 

. CACCCCTACTCTCCTCAACAA AACCAGCCAGCGCTCCTTCACTACCCCCC Majority 

f^O S320 533Q 5340 5350 

SS! GACCCCTACTCTCCTGAACAAAAGCACCCAGGGGTCGTTGACTACGCCCC 2603 ail sea 

^ AX C^C^cJcc^^r^^r AGGCAGGGGTCCTTGAGTACGCCCC S^it^eq 
^9 GACCCCTACTCTGCTCAACAAAACCAGCCAGGGCTC G T TGAGTACGrrrr mhi =,Ti 

^? rArrrrT^''^*''''*''^^**^' * * AAGCAGGCAGGGGTCGTTGAC T A CCCC C C cJblTT ai^seq 
S301 GACCCCTACTCTGCTCA AC AAAAGCAGGCAGGGGTCCTTCACTACGCCCC n«^lfi~ »il 
5300 *"ACCCCTACTCTGCTGAACAAAAGCAGGCAGGGGTCGTTGAGTACGdCCG a^altseT 

CATGCtTCAACTCAAACAAC A AATACGTCATCTGATTATTCCAAGAATTA Majority 

ggg 53 , TO 5380 5390 5400 

jf* 8 CATGCTTCAAGTCAAAGAACAAATAGGTCA T GTG A T TATTCCAAGAATTA 2603 all sea 
Zll * TGCTTGAAGTCA A AG A ACAAATAGGTCATGTGAT TATTCCAAGAATTA 18rs21 ail^seq 
.349 CATCCTTGAACTCAAAGAACAAATAGGTCATCTGATTATTCCAAGAATTA "hi ^7? ^ 
049 CATCCTTGAAGTCAAAGAACAAATAGGTCATCTGATTATTCCAAGAATTA U aiHea 
™ « CT T C A A G T C A A A C A A C A A * T A G G T C A T G T G A T T A TT C C A A G A A T T A "s2 

.350 CATGCTTCAACTCAAAGAACAAATAGGTCATGTGATTATTCCAAGAATTA SX 

ATCAGGATATCCCTATTTAC GCTCCCTCT.GCTGAACAAAA.TCTT. CAGAGG Majority 
S4 .'° 5420 5430 5440 5450 

?„® A n A ^^ ATA ^ tGCTATTTA ^ GCTGGCTCT « CTCAACAA AATCTTCAGAGC 2603 ail seo 
411 A J G ^ G ? G ^ A Z A ^ G ; GGTATT ^ AGGG ^ GGGT ''TGCTGAAGAAAATCTT.CAGAGG I8rs21 ai i?_eq 
^ot A I GAGCATATCCCTATTTACGCTGG CTCTGCTGAAGAAAATCTTCACAGC<i>hlaTl seq 

399 A T C A G G A T A T C C C T A T T T A C G C T G G C T C T G C T G A A' G A A A A T C T T C A G A G G SblTl aifsea 
A J G GA A ^ C T A T T T A C G C T G G C T C T G C T G A A C A A A A ^" G '*^ T C A G A G G nem316 _ ail ]s^[ 

400 ATCAC.GATATCCCTATTTACGCTGGCTCTGCTGAACAAAATCTTCAGXGG a909_a7l .seq ' 

CGCC TTGCACATTTACACCC CA CCACTCTTCCAGTCGGTC CTGACTCAAC Majority 

. S4 . 6Q S4TO S48Q 5490 5500 

«» GCCCTTGCACATTTAGACCCC AC C A G T C T T C C A G T C G G T G G T G A G T C A A C 2603 ail.seq 

11 CGCG II GGACATTTAGA CGGGACCAGTCTTCCAGTCGCTCGTGAGTCAAC ISrsil ail!_eq 

449 GGCGT.TGGACATT.TAGAGGGGACCAGTCTTCCAGTCCGTGGTGAG'TCAAC cohi all sea 
4"5^ GGGG ^^ GG ACA.TTTAGA. GGGGACCAGTCTTCCAGTCGGTGGTG A.G T-C A A C ^jbljT aiT^eq 
i51 " GGGG J^ GGAG ATTTAGAGGGGA'CCAG'TCTTCCAGTCGGTGGTGAGTCAAC ne.016~ait._e2 

450 GGCC. TTGGACATTTAGAGGGGACCA'GTCTTCCAG. TCGCTGGTGAGTCAAC aSOg.ali.se^ 

TCATCCCCTTCTAACTCCCCATCCA CM C T A C C A A C G C C C A A G C T A T T T A Majority 

S5 . 10 . ""P, 5530 ■ . . 55 .40 5550 

«* I G A I G C C G T T C T A A C T G C C C A TC G A; G G G G T A C O A A C G C G C A A G C T ATT ft 2603 ail.seq 

T C T * •* P ~ G C C C *~ G G A G G G C ■* A G C:A A C G.G C C A A G C.T ATT T A S Seq' 
£- t J< ^ -r ^ ^ ? ^ ^ ^ A A C T G.C C C A T C _ A'G'G C O T A C C A ^ C G. C 0 G' A A G-C T ATT T..A oohl aTilseq . 
J99 T C A T G C C G T T C T. A A CT8.CCC A TC GAG GCCXAjCCt AC G G C t k A G C f- AT T T A gImIi ai^eq 
^ t r i t ^ ^ ? ^ ^ ^ A' A G T G C &C'A\T:C. : G.'A{^ 4 '.^-'^t!.T A;<? C AiA'C G^ -C 1 GsA'A' GkG.T A : ^' T T. A TO^t6^__l.J_§q : ^ 
500: T C AT GC CO TT C T A A CTCC MA:«'C^ ; gStM^ A A C..G il C C A A G>C Tf A T T T A ipoaBTk^ 

C C A A T T T A C A C A A G G T A A C A C T A G C T G A C C G TT T T T A C AT T C A A C A C A T t Majority . , 

S 5 . 60 5570 , ". 5580 , .'■ 5590 5600 

«* C-C A A T T T A G A C A A G G T A A C ACTA G G T G A C C G T T tf T A G A T T C A A C A C A T C 2603- a il;seq 
■Il r 5-_ «^n A ^ A ^ A A G G T A A C A GT AGGTGACCGTTT.TTA C.ATTG A A C A C A TC 18rsil_ail^eq 
do CCAA ^T AGACAAGGTAACAG TAGGT.G ACCGTTTTXACATTGAAC AC AT C odhl ail.seq^ " 
^ r'f < i ttti J « AA 9 G ^ AAGA G ^ A G G T G A C'C G TT T T T A CA T T G AA G AC A T C cjblTl aiTtseq 
>S1 CCAATTTA.GACAAG G..T AACAGTAGGTGACCGTTTTTACATTCAACACATr „;„-,.«" 
SSO ''.^ T^Tl C AC A A'G £ TA A C ACT A T C AC'^GT TIJ IAC 1 f TC4 i C 4 C ATC a9TO_aiKseq 
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648 
417 
649 
649 
651 
650 



698 
467 



748 

517 

749 

749. 

751 

750 



567 
F99 

r99 

501 
300 



*48 
517 
M9 
!49 
151 
ISO 



>67 



m 

>00 



G C C G G A A 



Wr-y'G < 



A #A ffltTfr'A ATCAAA6TT AT 



5630 



5640 



CGCCCCTGA Majority 
5650 



598 
367 
599 
599 
601 
600 



GGCGGAAAGATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTCA 2603_ail seq 
GGCGGAAAGATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTGA 18rs21_all seq 
GCCGGAAAGATTGCTTAT.CAGGTAGACCAAATCAAAGTTATCGCCCCTGA cohl_aU.seq 

AAGATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTGA cjblU_ail seq 
AAGATTGCXTATCAGGTAGACCAAATCAAAG'TTATCGCCCCTGA ne*316__ai 1 .seq 
AAGATTGCTTATCAGGTAGACCAAATCAAAGTTATCGCCCCTGA a909_all seq 



G G C G G A 
G G C G G A 
G G C G G A 



TCAGTTAGAGGATTTCTACGTGATTCAAGGAGAACATCACGTCACCCT.AT Majority 



5660 



.5670 



5680 



5690 



5700 



TCAGTTAGAGGATTTGTACCTGATTCAACGAGAACATCACGTCACCCTAT 2603_ail.seq 

TCACTTAGAGGATTTGTACGTGATTCAAGGAG A. A GATCACGTCACCCTAT I8rs21_ai t.seq 

TCAGTTAGAGGATTTGTACGTGAT. TCAAGGAGAAGATCACGTCACCCTAT cohl_all.seq 

TC.AGTTAGAGGATTTGTACGTGATTCAAGGAGAAGATCACGTCACCCTAT cjblll_ail.seq 

TCAGTTAGAGGATTTGTACGTGATT C A AGGAGAAGATCAC GT CACCCTAT nem316_.ail.seq 

TC ACT T A G A G G ATT T CT A C GTG A T.T CAACGAGAAGATCACGTCACCCTAT a909__ail .seq 

TAACTT GCACACCTTATATGATAAATAGTCATCGCCTCCTCGTTCGAGGC Majbrity 
' 5710 5720 5730 5740 5750 - 

TAACTTGCACACCTTATATGATAAATAGTCATCGCGTCCTCGTTCGAGGC 2603_ail seq 

TAACTTGCACA.CCTTATATGATAAATAGTCATCGCCTCCTCGTTCGAGGC 18rs2l_all .seq 

S99 TAACTTGCACACCTTATATGATAAATAGTCATCGCCTCCTCGTTCGAGGC cohl_ail £ 

699 TAACTT GCACACCTTATATGATAAATAGTCATCGCCT CCTCGTTCGA 
701 T A A C T;T CCACACCTTATATCAT- AAATA&TCATC G C CTCCTCGTTC 

700 TAACTTGCACAC C.T TATATGATAAATAGTCATCGCCTCC 



.seq 

G G.C cjbltl„ail.seq 
GACCC nem3l6_ail.seq 
TCGTTCGAGGC a909__ail.seq 



A A GCGAATTCCTTATGTGGAAAAAACAGTGCAGAAAGATTCAAAGACCTT Majority 



.5760 



5770 



5780 



5790 



5800 



A A G C G A AT TCCTTATGTGGAAAAAACAGTGCAGAAAGATTC. AAAGACCTT 2603_aii.seq 

AAGCGAATTGCTTATGTGGAAAA A AC ACT GC AG A A AG ATTCAA A.GAC C'TT 18rs2l__ail .seq 

A A G .C G A A T T C C T T A T GT G G A A A A A A C A G T G CA G A'A A G A f T C'A A A G AC C T T cohl__atl.seq 

AAGC G AATTGC.TTATGTGGAAAAAACAGTGCAG A A AG ATTC AAAGACCTT cjblli_ail.seq 

AAGCGA.ATTCCTTAT. OTGGAAAAAACAGTGCAGA A -A GATTC AAAGACCTT nen_3I6„ail .seq 

AAGCGA'ATTCCTTATGTGGAAAAAACAGTGC AG A A AG ATTCA A AG AC CTT a909_ail .seq 

C A G G C A. A CAA CAATACCTAACCTATGCTATGTGGGTAGTCGTTGGACTTA Majority 
5810 5820 5830 5840 5850 

: 1 : _J ■ ' « 

CACGCAACAACAATACCTAACCTATGCTATGT G G GTA GTCGTTGG ACTTA 2603_ail.seq 
CAGGCAA'CAACAATACCTAAC'CTATGCTATGTGGGTA'GTCGTTGGACT T A 18rs21_ail seq 
CAGGCAACAACAATACCTAACCTATGCTATCTGGGTAGTCGTTGGACTTA cohl_ail seq 
CAGGCA ACAAC'AAT A CCT.AACCTATGCTATGTGGGTA GTCGTTGGACTTA cjblU_an.seq 
CAGGCAACAACAATACCTAACCTATGCTATGTGGGTAGT. CGTTGGACTTA neia316._ail.seq 
CAGGCAACAACAATACCfAACCTATGCTA.TGTGGGTAGTCGTTCGACtT A a909_a 11. seq 

TCTTGC TGTCGCTTC T C A T IT G G TT.f A A. A A AG A C G A A A C A G A A A A AG C GC Majority 



5860 



5870 



5880 



-5890 



5900 



T C; T T G C T G t:C.G C T t C t C A T ft G G T X T.A A A. A A GACGAAAC AG A A A A A G C G G 2603_.ail.seq 

T C T" T G C 'T-G- T £ Q C T ? C T CA T T T G "G T T T A A A A A G4 C. G AAA C A G? A AV^ A A G € G : & IBrsSl^iaU _seq 
T CT T G C-T C.T.CCt.CT T C T € ATT j G G t T t-A A A A A CA.C C A A A C, ArG .A A'A A AG ;C" G G ^hi_a>ltf JseqV 
TC T T T GO" C <^G T T 0TC A/^ 

T C T*T G:?\0.& C .T^^e^rC. t A..t.^VTvG'^:•T• Tfr.-T; 'kAvA/A ,A. G A, Ct-A ^'A- : C7A:--t-A^A^:A^''6^ uismim^iUs^i 'v.tf'-X , ; 

T CT T G C T G T C G C T T CT C A T^T.:G'- G^t T.'^A'A'A'-A A G A C G AA A C A^Va^A^A A-G C G G a909 ail ;seq : " 



A G AAA G.A A T G A AA A A G € C G C T A G T € A A A A T ACT C AC A A T, : A- AT ' T C ' G- AV, A'% A T' ,fc Ma : f oritur 

5910 5920 5930 5940 
« — —i^ J = 



5950 



A G A A A C A A T G A A A A A G C G G C T A C t C A A A A T A G T C A C A A T AA XT C G A A A T A -2603„ail.seq 
A G A A.A.G AATGAAAAAGCGGCTA GTC AAA ATA GTCACA AT A AT.TCG'A A ATA 18rs21_.all.seq 
AG AAA G A A T.G A A A A A G C G G C T A GTCAAAATA G TC A C A At A A t T C G A A A T A. cohl_aiKseq 
AGAAAGAATGAAAAAGCGGCTAGTCAAAATAG'TC.ACA AT AATTCGA A A T A cjbll Call. seq 
A G A A A G A A T G A. A A A A G C G G C T A G T C A A A A T A G TC A C A A T A A T T C G A A A T A neo316_all.seq 
AG A A AG A ATG A A AA AGCGGCTA.GTC AAAATAGTCACA AT AATTCG AA ATA a909_alLseq 
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A T A A" A A ^^^lagElS^iB B 5 CGAAGTCTGATT CTCTTATTT Itaioritv 

fiQfifl cn-m ' cnon - — 



59 t 60 5970 5980 5990 6000 

^7 ^AAAATCAGAACCCTCATTTTTGTGATGGCAACTCTGATTCTCTTATTT 2603 ail.seq 
5TI7 ATAAAATCAGAACCCTCATTTTTGTGATGGGAAGTCTGATTCTCTTATTT 18rs21 ail set* 

5949 -ATAAAATCAGAACCCTCATTTTTGTGATGGGAACTCTGATTCTCTTATTT cohHlT ^ 

^50 ^Z^**^^^^^^^^^^^^^^^^"^^^^^A^^^^AACTCTGATTCTCTTATTT nLsnGtiK^ 

5950 ATAAAATCAGAACCCTCATTTTTGTGATGGGAAGTCTGATTCTCTTATTT a909_ail.seq^ 

CCGATTGTGAGCCAGGTAA GTTACTACCTTGCTTCCCATCAAAATATTAA Majority 



5998 
5767 
5999 



6010 6020 6030 604O 6050 

-J — i— 



A I T 5 TCAGCCACGTAACTTACTACCTTGCTTCG ^ 2603 aU.seq 
rr?^^^^«^ CCAGGTAAGTTACTACCTTGC ^ TCGCATCAAAATAT TAA 18rsil ailseq 
^^ A ^5 TGAGCCAGGTAAGTTACTACCTTGCTTCGCATCAAAA T all.seq 
CCGATTGTGAGCCAGGT.AAGTTACTACCTTGCTTCGCA-TCAAAATATTAA cfblli all sea 

5000 CCGATTGTGAGCCAGGTAAGTTACTACCTTGCTTCGCATCAAAAT AT T A A a909_a7i 

TC.AATTTAAGCGGGAAGTC GCTAACATTGATACTAATACGGTTGAACGAC Majority 
€060 6070 6080 6090 6100 

5048 TCAATTTAACCGGCAAGTCGCTA.AGATTGATA.CTAATACGCTTGAACGAC 2603 all sea 
£\l l^^lll^ AGCCGG AAGTCG ^^ ^^^^^^<^^^ ^CTAAT.ACGGTTC AKCG AC 18rs2~l aiCeq 
^ ™ A A T T T A A G C C C C A A G T C G C T A A G A T T G A T A C T A A T A C C C T T C A A C G A C^^ all.seq 

TaATTTA.ACCGGGAACTCGCTAACATTCATACTAAT.ACCCTTGAACCAC cjblll ali!seq 
£fi J f ATTTA AGCGGCAACTCGCTAACATTGATACTAATACGCTTGAACUC «e*316~ail .seq 
1050 TCAATTTAAGCGGGAAGTCGCTAAGATTGATACTAATACGGTTGAACGAC a909_aTl^seq 

CCATCGCTTTAGCTAATGC TTACAATCACACCTTATCAAGGAATCCCTTC Majority 
, «U0 6120 6130 6140" 6150 

5?! 5^ TCGCT TTAGCTAATCCTTACAATGAGACGTTATCAAGGAATCCCTTG 2603 ail seq 

Sfl GGA IS GGTTTAGCTAATGCTTAeAA ^ TGAGACCt ISrsil ailseq 

»a S S f J G G CTTTAGC.TAATGCTTAC.AATGAGACGXTATCAAGGAATCCC.TTG coh I all.seq 

^? G G A T C. C T TT A G C T A A t C C t T A C A A T G A G A C G T T A T C A A G G A A T C C C T T G cjblll ailseq 

J™ G G A 3 G G G I T T A G C T A A T G C T T A C A A T G A G A C G T T A T C A- A G G A A T C C C T T G ne*3i6~ail .s^ 

51O0 G CA TCGC.TTTA G-C TAATCCTTACAATGAGACGTTATCAAGGAATCCCTTG a909_a7l.seq 

C T T A T. A GA.C CCTTTTACCA GTAA GCAAAAAGAAGC T"T TGA.CAGAGTATGC. Majority 

6*60 . 6170 618 0 6190 . . 6200 

Jf? CTTATAGACCCTTTTACCAGTAAGCAAAAAGAAGGTTTGAGAGAGTATGC 2603 all.seq 

?il ^TATACACCCTTTTACCAGTAAGCAAAAAGAAGGTTTGAGAGAGTATGC 18rs2dailTseq 

149 CTTATAGACCCTTTTACCAGTAAGCAAAAAGAAGGTTTGAGAGAGTATGCcohlail seq 
!f? ^I TATAGACCCT TTTACCAGTAAGCAAAAAGAAGGfTTGAGAGAGTATGC cjblTl aliTseq 
151 CTTATAGACCCTTTTACCAGTAAGCAAAAAGA A'GGTTTGAGAGAGTATGC ne*316~ail seq 

150 CTTATAGACCCTTTTA CCA GT A A CCA A A A AG A AGGT TT G A.G AG A G T A TCC.d909_ail.seq 

1C CGTATGCTTC A AG.TTCATG A GCAAATAG GTCATGTGG CA AT CCCAAGTA Majority 

. • 62 . 10 6220 * 6230 - 6240 '. " 6250 

™* I G G I A T G C G A AGt TCATG A GC A A ATA G.GT C A T GT fc'G AT CC'C A AG T A 2603 ail^eq 
*Z J*** A TGC TTG AA l G ? T?; AT GA G C A f A.A T AG G V T C:*A T G. X'&.G C A- A T -CC - C • A A £. T "A: ■ iBrsSi^afllseq 
55-2 I T ■ TTG A A C.TT.C ATGA CCA A A T : A G G kX G A JT.fi T,GG ^G,A;^C fee^ G' T. A coiiiLal 1 - seq — 
199 TCGTAT G C TTG A AG T T C A T G A G C A A A T A G G T ii AsT >fe*. n'r ; * _-*J* ^-.'^SZ^^^I-JL 



T.G 

20 1 T C"G T A T G C T T G A 
2O0 T C'G T A T G S T. T ?G A 



T T G C C G TT C A-T'Af -T. .C C^A^A -TAT GC .*r&;jfo4^AV^ 

G2 , 6 ° • 6270 ^ •'. « 6280 •"' '• " ;! 6300' 

T ?*C.C G G T T G AT AT TCCAA TTT A T G C;T G <J A A C A T C C G A A A £ TC^G 5c r T C A G 2603 _il.seq 
11 I It 5 5* TTGATA T T C C A A ^ TT A T G C T.G G A AC A T CC fe A.A A C TG tVC T T C AG I8r_ii ail^eq 
Itl I J G G G G T T G A T A T T C C A A T T T A T C C T C C A A C A T C C G A A A C X 0 T G C T T.C A G cohl aTllseq- 

249 T T G C.G G T T G A T A T T C C A A T T T A T G C T G G A A C A T.C C G A A A C T G T G C T T C A G cjblll aiTseq 
ll\ |? GGGG TTG A TATTCCA ATTT ATG.CTCC A AC ATCCG A A ACTGTGCTTC AG nem316""aU.^ 

250 TKGGGTT GAT ATT.C.GAATTTATG.CTCGA AC ATCCG AA AC TGTG C"T T C A G a909 all seq^ 
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tuc " Kteggjc^dU !fc:iirli i ilfc K ,A-Q^ ggjgj 'jul^ tctt cc actggg agctttgtc ^ 

63 .20 6330 6340 6350 

6298 AAA CCTACTCCCCATTTCCAGGGAACCACTCTTCCAGTC CGACCTTTrTr ?«m ,-. 
60S? AAAGGTACTGGGCATTTGGAGGGAACCAGTCTTCCAGTGGGAGGTTTGTC l^T if 
4,299 A A A C G T A C T G G G C A T T T-G G A G G G A A C C A G T C T T C C AG TGGG AGGTTTGTC Ilf aTl L^** 

6299 AAAGCTACTGGGCATTTGGAGGGAACCAGTCTTCCAGTGGGAGGTTTGTr .7* 
6301 AAAGGTAGTGGGCATTTGGAGGGAACCAGTCTTCCAGTGGGAGrTTTrTO J h ^ 

6300 AAAGGTAGTGGGCATTTGGAGGGAACC^GTCTTCC^G^GGG^GG^TGTC S.'^ 

AACCCATTCAGTA.CTAACTGC CCACCGTGGCTTCCCAACACCTACGCTAT Malorlrv 
•. 6370 6380 6390 6400 

6348 AACCCATTCAGTACTAACTGCCCACCGTGGCTTGCCAACAGCTAGGCTAT 2<aw »ll 

6117 AACCCATTCAGTACTAACTGCCCACCGTGGCTTGCCAACAG^CTAGGCTAT ttr^i nif^L, 

22 AACCCr^CAG?ACT^c?^^ A ^ CGTCGCTTGC ^^ 

t>J4a AACCCATTCAGTACTAACTGCCCACCCTCCCTfGCCAACACCTAfirrTAT H hin ^ jT 

63* AACCCATTCAGTACTAACT^r nem316Zai 1 *seq 

6350 AACCCATTCAGTACTAACTGCCCACCGTGGCTTGCCAACAGCTAGGCTAT a909_.all.seq 

TTACCCACTTAAATAAAGTTAAAAAACCC CACATTTTCTATGTCACCAAC Majority 
fj_2 6 4 , 20 6430 6440 6450 

6^9 ^ACCrl C ^T^ AA ^ AAAC ^ AAAAAAC C C C A GATTTTCTATGTG.ACGAAC 18r S2 l_ a i l^eq 
bd99 1TACCGACTTAAATAAAGTTAAAAAAGGCGACATTTTCTATGTGACCAAC <-ihii 1 =i7^L, 
B4GO TTACCGACTTAAATAAAGTTAAA.AAAGGCCAGATTTTCTATGTCACCAAC a909_ail.seq 
A.T CAACCAAACA'CTTCCCTAC'AA AGTC GY G T CTATCAAAGTTGTCCATCC Majority 

. •' 64 t 60 64 . 70 6480 6490 6500 

5448 ATCAACGAAACACTTGCCTAqAAAGTCGTGTCTATCAAAGtTGTGGATrr ran *ii <™ 

B«9 A TC A aS G A £ Ic AC . T T 6-C C-T AC A' A 'A GTC G T CT A T C A A AG T t C T G G A T C C 

6451 AT C A.ABG A A A C A C T"T CCCTACAAACTCGTGTCTAICAAAGTTGTCGATGC ^6^'^ 

6450 ATCAAGCAAACACTTGCCTACAAAGTCGTGTCTATCA-AAGTTGTGGATCC a^al,"^ 

AACAGCTTTAAGTCACGTTAAGATTGTCA ATCCTAAGGATTATATAACCT Majority 
i 2_2 6S20 6530 6S40 TsSO 

~7 AAC l CC inf AG ^ GAGCTTAAGATTGTCAATGCT AA«GATTATATAACCT 2603 all.seq 

5267 AACACCTTTAAGTGAGGTTAAGATTGTCAATGGTAACGATTATATAArCT IR.-^! = 

5499 A.ACAGCTTTAAGTGAGCTTAAGATTGTCAATGGTAACGATTATATA : ACrT ™£f 7. ^ 

5499 AACAGGiTTAAGTGACCITAA'GATTGTCAATCGIAACGATTATATAAC P-T ^hJlf ^ 
5501 AACAGCTTTAAGT.GAGGTT.AACATTGi CAA^GGTAACGA??^^ 

5500 AACAGCTTTAAGIGAGGITAAfiATTGTCAAfGGTAAGGATTATATAACGt 2S5S!^ 

TCCTGACTTCCACACCTTAC A T C A T C A A T A.G T C A T C C T C T C T T G C T A A A A Majority 
6560 ' fiSTh . ccon * : * _• "• 



. : : 6 570 . 6580 6500 . 6600 

^ I J C T G A GT T C C A C AC G fT A C A T G A T C lA^ G T C A T C.C T C T.C T T 6 G f A A A A 2603 ail sea 
317 T GC TGACT;TGC A € ACCj.TAGA T.G A T C A AT A G T C A t C G T C T & T t G C * -A A A r A i^rliHL 

X G € T G A C T T. G C A C . A- CC T C AH C A T G A A T A a t r >_ «n « * ^ ^ i^.t V 2 

1S$L : T-GX T 
1550 vTitrC T 



• ' . . J 5 ^ 0 fifoO ' 6650 

1598 GGACA G C G f A T T CC T T A T G A T TC T A CC G A G G C G G A A A A G C AC A A AC a a rl^M « 3 t 

367 C.G.A G AGCGTAT T C C T T A T.G A T TCTAGC G A GdC.GG i A* A A GC A C A A A cAck SfliHeo 

^oo ? ^ ^ S ^ CGTATT C c T T A T G A T T C T A C C G A G G C G G A A A A G G A C A A AG A A C A 2f ali 

^9 G G- AG AGCGTAT T.C CTTATGAT.TCTACCGAG GC GGAAAAG'CACAAAGAAGA cj^ lT? ail^ea 

Ml C G'AG A G CG.TATTCCTTATGATTOTA CCGAGGCGGAA AAGC ACAAATAATA Sis ii"^ 1 

***>' G G A. G A G C G T A T T C C T T A T G AT TC, A C C G.A G G- C GO A A A A G C A t A A AG AACA K^i^T 
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. A A C C G T fej^lSgHLSBS g gjggj!^ illCTTCAAGATACTACTACTAT Majority 

66 , 60 6670 6680 6690 6700 

^ ^CCGTACAAGATTATCGTTTGTCACTACTGTTGAACAtACTACTAGTAT 2603 ail.seq 
£2 !^^ TACAAGATTATCGTTTCTCACTAGTGTTGA AGATACTACTAGTAT ISrsil ail seq 

ti. TA CAAGATTATCGTTTGTCACTACTCTTCAACATACTACTAGTAT cjblll ailleq 

AA CTACAAC ATTATCGTTTGTCACTAGTCTTCAAGATACTACTAGTAT nLlufail 
,650 AACCGTACAAGATTATCGTTTGTCACTAGTGTTGAAGATACTACTAGTAT a909^7l,seq 

TATTAATTGGACTCTTCAT CCTGATAATGATGAGAAGATCGATGCAACAT Majority 
67 .*° 6720 6730 6740 6750 

TATTAATTCGAC TCT TCATCGTCAT AATG ATGAGAAG ATGGATGCAACAT 2603 ail seq 

TATTAATTGGACTCTTCATCGTGATAATGATGAGAACATGGATCCAACAT 18rs21 allseq 

TATTAATTGGACTCTTCATCCTGATAATGATGAGAAGATCGATGCAACAT cohl all seq 

TATTAATTGGACTCTTCATCGTGATAATGATGAGAAGATGGATGCAACAT cjblll ail seq 

TAT T A ATTGGACTCTT C'A T C G T G AT AATGATGAGAAGATGGATGCAACAT nem316"ai 1 s2 

TATTAATTGGACTCTTCATCGTGATAATGATG A.G AAGATGGATGCAACAT a909 all seq^ 



ms 

>4G7 
1699 
S699 
1701 
I7QO 



CCTCAATAACGATGTTGT GAATGGCTTACTTACTTATCAAATACGTGACT Majority 

6760 6770 6780 6790 6&00 

* 1 1_ 



74S CGTCA AT AACC ATGTTGTG A A TGGCT1ACI BB 1 I A H T A G GT C A C T 2603 ail seq 

517 CGTCA ATA ACG ATGTTGTG A ATGGCTTACTTACTTATCA AAT ACGTGA CT 18rs21 all seq 

749 CGTCAATAACGATGTTGTGAATGGCTTACTTACTTATCAAATAGGTGACT cohl all seq 

749 CGTCAATAACGATGT TCT G A ATCGCTTACTTACTTATCA AATACGTC ACT cjblll allseq 
751 CGTG AAT A ACGATGTTGTGAATGGCTTACTTACTTATCA AATAGGTGACT nem3l6~ail "seq 

750 C 6 T C A A T A A C G A T G T T G T G A A T G G C T T A C T T A C T T A T C A A A T A G G T C A C T a909_alt.seq 

AATC.ATG ATTGTGAATAAT GGT T A TCTAGAAGGCACAAAAATGAAAAAGA Majority 
. ' 68 10 6820 6830 6840 6850 



344 
H7 
549 
349 



794 AATGATGATTGTGAATAATGGTTATCTAGAAGGGAGAAAAATGAAAAAGA 2603_ail seq 
567 A A T C A T C.A T T G T G A A T A A T G G T T A T C T A G A A G G C A C A A A A A T G A A A A A C A 18rs21 allseq 
7 J® AATGATGAT.TGTGAATAATG.GTTATCTAGAAGCCAGAAAAATGA AAA A G A cohl all. seq 

^? AA I5 ATGATTGTGAATAATGGTTATCTAGAAGGGAG ejblTl ailTseq 

™ A A 5 S ATG ATTGTGAAT AAT G'GTT AT.CT A.G A.AGGG A G* A A A A AT G A A'A'A A G A nem316~alKseq 
800 AAT G AT G ATTGTGAATAATGGT* T.* A TCTAGAAGGGAGAAAAATGAAAAAGA a909„a7l.seq 

G A C A A A A A A T A T G G A G A G G G T T A T C A G TTA C TTT AC T A A TCCTGTCCCAA MaJ or i ty 

, • gS. 60 . 6870 6880 6890 ^6900 

G AC A AAAAATATGGAGACfeGTTATCAGTTACTTTACTAA TCCTGTCCCAA 2603_ail seq 
GAGA AAAAATATGGAGAGGGTT ATCAGTT AC T T-T ACTAATCCTGTCCCAA i8rs21_ailseq 
GACAAAAAATATGGAGAGGGTTATCAGTTACTTTACTAATCCTGTCCCAA cohl ail seq 
GACAAAAAATATGGAG. AGGGTTATCAGT-TACTTTACTAATCCTG t-C C C A A cjbllllailseq 
551 GACAAAAAATATGGAGAG. GGTTATCAGTTAC f T T A CT AATCCTGTCCCAA neo3i6_ail seq 
350 GACAAAAAATATGGAG AG G.GTTATCAGTTACTTTACTAATCCTGTCCCAA a909_aii.seq 

ATT C C ATTTGGT AT AT T G G T A.C A A GG T G, A A. AC C C A A G A TA.CCAATCAAGC Majority 

, 69 /° 6920 6930 6940 .. .6950 

»4 ATTCCATTT.GGT ATA T T G f T AG A ACG T G A A A CC C A AC A T A C C A A T C A A G C 2603 all.seq 
j67 ATT^J C A T T T G'G T. A T : A -T T. G G' T* A C. A A G:G T S G A I A A.C C C A AG A T A C C A A- T C* A A G C 18rs2l allVsea 
$ 9 #T.^VC,.C,A T.T T G G T„A TiT^.T A;'C A,A G G T G A A A ;C f C C A A CA T A C C A A T C A A G C cofil all.seq^ 
A K^J^T:C^-itA t|^^Ti^ : :fA,^ i Gix4*:'3ii^G-G-.A A* A T A C C-i-A-T.C A A G b cjbll* , al^eq 

«0 ArT-y.C C As TTT G^'T-ATAT T'G;,GT ACAA.GGTGAAACCCAAGATACCAATCAAGC a909_ail .seq^ 

• ^t^^^ AvG^A? C A A' T'G C^ A G.A C C A T T AG Malobitv- 

' 6 ^ 6Q • , '• 6 %? D ' 6980 6990 ; 7000; 

144 A C T T G G A A A A C T A A TT G T T A A A A A A A C G G GAGA C AATG CT A C A G C ATT A G 2603 ail seq 
ri7. : A Cy T G G A A A A 0 T A A T T.GTt.A A AAA A ACGGGA G A C A A T G C T A CACC At TAG 18rs5i ail!seq 



A C TT G G A A A ACT AAT T„G T T AAA AAA ACGGGA G AC A AT G C T A C A CCA T TAG cohl all.seq 

ACTTGGAA.A AG TAATT C.TTAAAAA AACGGGAGAC AATGCT AC A C C A T T AG cjblll allseq 

)Sl A CT TvC'C AAAAGTAATTGTTA.AA A. AAA CGGGAG A "C AATG C TAG ACCATTAC nem316~ai 1 seq 

>50 A G T T G G A A A AG T A A T TGt T A AAA AAACCG gQ G A CMTGCTACACCATTAG a909 all seq 
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JjgnmeiTt Report of At-1__a«gnment using J. Hein method with Weighted residue weight fable. D _ ^ 

G C A A A G cfeic T l^TVa C"TT A A fc'"A A a""t"g AC A~" 



T G A T A ACTCACAAACAAGT Majority 
7 ° t iQ 7020 7030 704O 7050 

yf^y^fyfy^TTTyr^ 2603_aii.seq 

52 ^p^^^^^-^^^^^-^]E^^^^^^*^^^^^^^^AATGATAAGTCAGAAACAAGT 18rsil ail^eq 

^AAAGCG ACTTTTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGTcohlaTl sea 
SS? ^AAAGCC^CTTTTGTGTTAAAAAATGACAATGATAAGTCAGAAACAAGT cjblU i^eq 

GCAAAGCGACTTTTCTGTTAAAAAATGACAATGATAAGTCAGAAACAAGT a909_all seq 



JQOt 

robo 



C A C G A A 



ACGGTAGAGGG. TTCTGGAG AAGCAACCTTTGAAA.ACATAAAACC Majority 
TOGO 7070 7080 7Q9Q 7100 * 



2fi r* ^ r» r« ^ * ^ CTAGAGGGTTCTG GA GAAGCAACCTTTG.AAA A.CATA AAACC 2603 aiiisea 

Zl C AC GAAAGGC I AGAG ^ GTTCTGGAGAA C€AACCTTTGAAAACATAAAACC 18rsll_ail^eq 

Zt rArfAiirrrT«r*r^ TCTGGAGAAGCAACCTTTGAAAACATAAAACC cohL.aii.seq 

nl? GA J GAAAGGGTAGAGCGTTCTC GAGAAGCAACCTTTGAAAACATAAAACC cjblll alileo 

r0S0 C A CGAAACGGTAGAGGGTTCTGGAGAAGCAACCTTTGAAAACATA A A A C C a909 ail seq 



TGGAGACTACACATTAAGAGAAGAAACA GCACCAATTGCTTATAAAAA.AA Majority 
7110 71 20 , 7130 . 7140 71150 

m4 TGGAGACTACACA TTA AGAGAAGAAACAGCACCAATTGGTTATA.AAAAAA 2603 ail sea 

>867 TGGAGACTACACATTAAGAGA AG AAACAGCACCAA TT GGTTATAAAAAAA 18rs2I aiisea 

T099 TG'GACACTACACATTAAGAGAAGAAACAGCACCAATTGCTTATAAAAAAA cohf all s£ 

I G ^ AGAG I AGAGATTAAGAGAAGAAA CAGCACCAAT ^ ill aiHeq 

riOl TGCAGAC T A- CACATTAAGACAAGAA AC AGCACCAATTGGTTATAAAAAAA nem 3 i6~ail sea 

riOO TGGAGACTACACATTAAGACAACAAACAGCACCAATT GGTTATAAAAAAA a909_iall .seq 

CTGATAAAACCTGGAAACTT AAAGTTCCAGATAACGGAGCAACAATAATC Majority 
. 7160 7170 7180 7190 7200 

lilt GTGATAAAACCTGG AA.AGTTAAAGTTGCAGATAACGGAGCAACAATAATC 2603 ail. seq 

»!I CTG.ATAAAACCTGGAAAGTTAAAGTTGCAGATAACGGAGCAACAATAATC I8rs5l ail^eq 

149 C T G A T A.A A A C C T G C A A A G T T A A A G T T G C A G A T A A C G GAGCA ACAATAATC cohl all seq 

140 T.GATAAAACCTCCAAAGTTAAAGTTGCAGA TA A C G G AG CAACAATAA T C cjblll aifL, 

riSl CTGATAAAACCT G"G A A A G T T A A A G T T G C AG A T A A CGGAGCAACAATAATC n^l^a se^ 

TGATAAAACCTGGAA.ACTTAAAGTTGCACATAACGGAGCAACAATAATC a909_ai 1 ..seq 

GAGGCTATGGATCCAGATA AAGCAGAGAAACGAAAAGAAGTTfTGAATGC Majority 
7210 7220 7230 ' 724Q 72SO 

GAGGGTATGGATGCAGATAA AG CAGAGAAACGAAAAGAAGTTTTCAATGC 2603 ail.seq 
>967 GAGGGT A" TGGATGCAGATAAAG" CAGAGAAACGAAAAGAAGTTTTCAATGC 18rs21 aiileq 
199 GAGGGT A TGGATGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGC cohl all sea 

GAGGGTATGGATGCACATAAAGCAGA C.AA ACGAAAAGAAGTTTTGAATGC cjblll ail seq 
^ GACGCTATGGATCCAGATAAACCAGAGAAACCAAAAGAAGTT TT C AATGC nLsis'ail sea 
rZOO CAGGGTATGGATGCAGATAAAGCAGAGAAACGAAAAGAAGTTTTGAATGC a909_a 1 1 . seq, 

C C A A T A T C C A A A A t C A G C T A T T T A T G A G G A T A C A A A A G A A A A T T A C C C A T Majority 
. - 7 jj£° 7270 7280 / " 7290 7300 



riso c 



ri94 



^44. C G A A T A T C C A A A A T C A G .C T A T TTA T G. A G G A T A C A A A A G A A A ATT A G C CAT 2603 all seq 

I £ A *I A IS C . A A A ATCA-G GTA TXT At G A G.GATA.? A A A A G A A A AT T AC C- C T . IBrsIi aiHL, 
^249 CCA AT A.T G GA A A.A TC A G C TAT TTA T G AG G A TAG A A A AG A A A A T T A C CG AT cohl all seq 
>249 CCA A T;A Tj C C a A> A* C A;G G T> T/T T A T C A C G ATA C 'Avjft- A A O A A A A** t, T. , A C C G A . t cjlilil^a^eq 
*51 C CIA A -X A T G C A A A A T C A G-rT^TTt Af G Ar^G A'T^ C A A A A, GA A Ai ^A ifec A J i^SSii 

C C A A T A T CCA f AHTC A G/C T A.T T. T 'A T G AiG 'GV A T A : C A A A A, G A : a\TT A^ 'C* 'g^ T ^sWSt se<|- 



^50 



TAG 



T T A A T G T- AG A G G CTTCC.A A AG TT G G t G A A G. A AT A C A A A G € A T T G A A ^ Matori^ 

•• * Vim ' 79Dn -roon ^ 



73 . 10 7 W . 7340 . 7350 

^294 T A G T T A A T G : T AG A GGGT t.C C A A A G TTG G T GA AC A AT AC A A AG C A T T G A. A T 2603 all seq 
'067 T A G T T A A T G T A G A G G G T T C C A A A G T T G G T G A A C A A T A C A A A G C A T T G A A T 18rs21 all seq 



^299 TAGTTAATGTAGAG G.C TTCCAAAGTTGCTGAACAATACAAAGCATTCAAT cohl all seq 
VS® TAGTTAATGTAGAGGGTTCCAAAGTTCGTGAACAATA C A A A G C AT T C A A T cjblll ail^eq 
f30J T A G T T A A T G : T A G A G G G T T C C A A A G T T G G T G A AC A ATACAAAGCA TT GA A T nem316~ail seq 
TAGTTA ATGTAGAGGGTTCC AAA'GTTGGTGAACA ATACA AAGGATT G A AT a909 aTl.seq 



rsoi 

^300 
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C C A AT A Mm gll Ifeg C A ~T~T~c CTCAAGGTTCGTTATC 

. J 3 *? 73 . 7 ° 7380 7390 7400 

EE "^^^^^^^^^"""'''^"""'"""""Ati 2ms-.Il.se, 
tt49 A J J I GCTCCAUACU4TUCTGSiC " Ti:,;TT *" allien 

ras! C^n?AA A Jcc AAAAGA £ GGTCCAAGAGAGAT ™^ ^Jbni aiHeq 

rafo c c A ^^ A A^^r GAAAA r A ^ GGTcGAAGAGAGATTGcTGAAGGTTGGTTAT c n.^ 

r3S0 CCAATAAATGGAAAAGATCCTCGAACACACATTGCTGAAGGTTGGTTATC ag09_ a 7l.seq 

AAAAAAAATTACAGCGGTCAATGATCTCC A TAACAATAAATATAAAATTr. MaJorHy 
™° 7420 7430 7440 7^50 

lifr AAAA A a A ^tta^^^ GGG ^ G GGA ^ AAGAA ^ AAA ^*^ AAA * t ^^ 2603 all sen 

'167 A A A A A A A ATT ACAGGGGTCAATGATCTCGATAAGAA T'A AATATA A A A *r T r laZvt i. 

^9 A A A A A A A A^ IlCirfffTfil^''*^''"'''*^* coh4 aTl ^seq 6 '' 

fJS9 AAAAAAAATTACAGCGGTCAATGATCTCGATAAGAATAAATATAA A a -r v r ~««,*7i 

nA A AAA A ^ ACAGGGGTCAAtpATCTCGA?AAGAA ^ AAA ^ 
400 *AAAAAAATTACAGGCGTCAAT.GATCTCCATAAGAATAAATATAAAATTG a 909_a7l "eq^ 

AATTAAC-TCTTGA CGCTAA AACCACTGTTGAAACGAAACA.ACTTAATCAA Majority 
" 7 y° 7470 7480 7490 7500 

«9 a^a"a c t^™ g a gg ^ aaaaccac ^ is^LaniL, 

.1=, . A II AAGTGTTGAGCGTAAAACCAC TCTTCAAACCAAAGAACTTAATCAA clblll ail^sea 

iso AA- T TAA- c T'cT'T^A GGG ? AAA i^^ s :s 

450 AAT.TAACTGTTGAGGGTAAAACCACTGTTCAAACGAAAGAACTTAATCAA a909_all.seq 

CCACTAGATC TCGTTCT GCTA.TTACATAATTCAAATACTATCAATAATGA Majority 
• 75 . 10 7SZ0 7530 7540 7550 

^7 C C'A C T A p A ^pJS5JZ G J GG A TTCAAATACTATGAATAATGA 2603 ail.seq 

™I GGAGTAGATGTCGTTGTG CTATTAGATAATTCAAATAGTATGAATAATGA 18rs21 allien 

499 CCACTAGATCTCGTTGTCCTATTAGATAATTCAAATAGTATCAATAA-rrA . 7, ^ 
^ r r i r t A G A ^ G ^ GG ^^ GTGG ^ A TTAGATAATTCA A" A T A G T A T'G A A T A A'-T G A ctol7? allien 

M CCACTAG'ATGTCrTTrTrrTAXT^r »em316tjal 1 !seq 

500 CCACTA. GATGTCGTTCTGCTATTACATAATTCAAATAGTATGAATAATGA a909_all.seq 

AACA. CCCAATAATTCTCAAAC. AGCATT AAAACCTCGGGAACCA G T T C A A A Majority 
, 7S60 7570 7580 7590 7600 

5 *i "S* CCCAU A A T TCTCAAAGAGCATTAAAAGCTGGGGAACCAGTTGAAA 2603 all sea 
lil AAGACCCAATAATTCTCAAAGA CCATTAAAAGCTGGGCAACCXGTTGAAA Slaifseo 

S? AACACCCAATAATTCTCAAAGAGGA TTAAAAGCTGGGGAACCAGTTCAAA c^b 7i alHL 
^o A «"**TAAITCICAAACA(;.CAITAAAACCTCCGGAACCACTI AAA ne*316~a U*2 

550 AAGAGCCAATAATTCTCAAAGAGCATTAAAAGCTCGGGAACCAGTTGAAA aoSl.seT 

ACCTCATTGATAAAATTACA TCAAATAAACACAATAGAC.TACCTCT.TCTC Majority ' 

76 r 10 7620 7630 7640 7650 . 

SI * G G I G A T T G A T A A A A T T A C A T.C.A A A T AAA G A C A A T A C A C T A G C T C T T 61 G 2603 all sea 
ST V 5 C *•« A T T G A T A A A A TT A C A T C A A A T A A AG AC A AT AC A G T A C t T C T T G J-fc^rlifLq 

601 A G G .T G A T T G A T A A A 
BOO A G C.T G A T TIG AT A A,A 

A C A T AT C C C T C A A C C A T T T T T C A T G C f A C T G.A A.Ci CJC A Ci CrlG.T A t' C A A A G 6 C .HAlorltv 
. 76 . 60 . 7670 - 7 ^ Q 7too 

S£ A r FITj Si'Sj-J A A C C A T T T T T G A T G G T.,A C T G A AG C G A p CCIATC A A A G G G 2603: alKseq 
417 ACATATGCCTCAACCATTTTTCATGCTACTGAAGCGACCC TAT C A A A G C G lRi^l »s7\L. 
B4| A Ci T AT C C CTC A A p C ATT T TTC ATGCTA C.I C A AC CCA tC CIAT CA "A A GG G ^Srt^ 

I™ ACA J A I GCC I CAACCATTTTTGA TGGTACTGAAGCGACCGTATCAAAGCG nUie'a if S 
S50 . ACATATCCCTCAACCAT TT.T.T C A T G C.T A C T G A.AC C GACCCTATCAAAGGG SaTi se^ 
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A C T T G C dR^fe A'Ayijigyij A' aS^C Wk 



ATCATACTGTATCATGCGATT Majority 
7710 7720 7730 7740 7750 



Sf AGTTGCCGA T C A A A A T G G T A A AC CG CT C A AT C A T A G TCT A T C A T G G C A T 1 2603 ail seq 
It™ TTGCCGATCAAAATCGTAAACCGCTCAATGATAGTGTATCATGCGATT ISrsIl ah^eq 

^ " TGCCCATC AAAATGGTAAAGCGCTGAATGATAGTGTATCATGGGATTcohuil seq 

II CCGATCAAAATCGTAAACCGCTCAAT ^ TA CTCTATCATG.GGATT cjblTi ail^eq 
T701 ACTTGCCCATCAAAATGGTAAACCGCTGAATGATAGTGTATCATGCGATT ne*316~ail s2 
F7O0 A G T T G C C G A T C A A A A T G G T A A A G C G C T G A A T G A T A C T C T A T C A T G G G A T T a909_aTi.seq 

ATCATAAAACTACTTTTA CACCAACTACACATAATTACAGTTATTTAAAT Majority 

776 ^ J 7 ™ 7780 7790 7800 

Zlt "M'S ATAAAACTACTTTTACAGCAACTA C'A C A T'A A T T A C A G T T ATTTAA AT 2603 ail.seq 
SI "!I ! AACnCTTTTACAGCAACTACACATAAT TACACTTATTTAAAT 18r 5 2~l ailTseq 

Z*l *ZCAT.AAAACTACTTTTACAGCAACTACACATAATTACAGTTATTTAAATcohl aTl.seq 

3f? ATCATAAAACTACTTTTACAGCAACTACACATAATTACAGTTATTTAAAT cjblTi aiHeq 

Ztn a^J^ ATAAAACTACTTTTACAGCAACTACACATA ATTACAGTTATTTAAAT ne*316~aU.se^ 

750 ATCATAAAACTACTTTTACAGCAACTACACATAATTACAGTTATTTAAAT a909_all.seq 

T.TAACAAATGATGCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAA Majority 

. ™0 7S2Q 7830 . 784Q 7850 ' 

794. TTAACAAATGATGCTAACGAAGTTAATAfTCTAAAGTCAAGAATTCCAAA 2603 all seq 

™l TTAACAAATCA-TGCTAACCAAGTTAATATTCTAAAGTCAAGAATTCCAAA ISrsil all!seq ' 

799 TTAACAAATGATGCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAAcohiaTl .seq 

799 TTAACAAATCATGCTAACGAAGTTAATATTCTAAAGTCAAGAATTCCAAA cjblTi aiTsea 

801 TTAACAAATGATGCTA AC GAAGTTAATATTCTA A A G T C A A G A A T TC C A A A neffl316*"ail seq 

BOO TTAACAAATGATGCTAACGAAGTTAATATTCTAAAGtCAAGAATTCCAAA a909_aTl.seq 

GGAAGCCGAGCA. TATAAATGGGGATCGCACCCTCTATCAATTTGCTGCGA. Majority 
7860 7S70 7880 7890 79O0 



B44 CCAACCCCAGCATATAAATGGCGATCGCACCCTCTATCAATTTGCTGCGA 2603 ail seq 

617 CGAAGCGGAGCATATAAA'TGGGG AT CGCA CGCTCfATCAATTTGGTGCGA ISrsil ail sea 

849 G G A AG CGGAGCA-TATAAATGGGGATCGCA CGCTCTATCAATTTGCfGCGA cohl aTl sea 

849 GGAAGCGGAGGATATAAATGGGG.ATCGCACGCTCTATCAATTTGGTGCGA cfblH ailleq 
B51 GGAACCCGAGCATATAAATUGGATCGCACGCTCTATCAATTTCCTGCGA ne«316~ail"seq 

850 GGAAGCCGAGCA TATAAATGGGG A TCGC A. CGCTCTATCAATTTGGTGCGA a909^aTl.seq 

UTTTACTCAAAAAGCTC TAAT-CAAACCAAATGAAATTTTAGACACACAA. Majority 

. 79 t IQ 7920 7930 7940 . 7950 

894 CATTTACTCAAAAAGCTCTAATGAAAGCAAATG A. AATTTTAG AC AC ACA A 2603 ail.seq 
667 CATTT, AC.TCAAAAAGCTCTAATGAAAGCAAATGAAATTTTAGAGA.CACAA 18rs21 aiTTseq 

5^III A ^ TCAAAAAGCTCTAATGA AAGCAAATGAAATTTTACAGACACAAcohi aTl.seq 

899 C A T T T ACTCAAAAAGCTCTAATGAAAGCAAATGAA. ATTTTAGAGACACAA cjblli ail!seq 
901 CATTTACTCAAAAA-GCTCTAATGAAAGCAAATGAAATTTTAGAGACACAA nea316~ail.seq 

900 CATTT.ACTCAAAAAGCTCTAATGAAAGCAAATGAAATTTTAGACACACAA a909_aTl.seq 

A. G T TC T A ATCCT A G A AAA A A ACTTATT'TT TC A C C TA ACTCATC.CT GVT C CX Majority 

7^° . 7970 ' 7980 x . 7990. " " ' ' IkJoo 

944 A G T T C T A A T G C T A G A A A A A. A ACTTAT T t.T t C AC G T A A C T,G AT GGTG T C C G 2603 ail seq - 
717 AGTTCTA AT G C T A G A A A A A A AC T TAT ffTTC A-C G T A A C T G'A T G C T- G: T,G .C C 18rs21 aljkseq 
949 ACT T C.T A A; T.G G T. -A. G * * * " £ A " - **" ** "* ™ - - * " * - - - ^- ^ ^ 

949 A G.T ;T C-T A A T G C T A G 
951 A& X T, C T . A A T'GrC jt A c ' 

950 A G T.T C T A A T G^G it A .G'-j 




TAG 



8 Q 10 . . aqao 8040 soso 

T AC G At G T C .TTAT G G C ATA A A T T T T A AT C C T T A T A T A T C A A C A T C T.T AC C 2603 ail seq 



?67 T A C G A T G T G T T A T G C C A T A A A T T t T A A T C C T T ATA T A T C A A C A T C T T * C C 18rs21 aillseq 

999 TACGATGTCTT A T.G C C A T A A ATTTT A A TCCTT AT A T A TC A AC A T CTT A C C cohl aTl.seq 

999 TA.CGA T.G TC T T-ATCCCAT AAATTTT AATCCTTATATATCAA-C ATCTTACC cjblTl ail!seq 

301 T A C G A TC T C.T T A T G C C A T A A A T T T T A A TC CTT AT ATATC AAC A T C T TACC nem316~ail.s2 

300 T A C G A T G T C T T A T G C C A T A A A T T TT A A T C C T T A T A T A T C A A CATCtTA C C a909 dT? seq 
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A A A A C C j p fflg g gSgTWS SlB ATACCAGATAGAAGTCC TATT Majority ' 



8O60 8070 . 8080 8090 



8100 



5ff *M!^! GTTTAATTCTTTTTTAAATAAAiTAC CACATAGAACTCGTATi 2603 ail seq 

AAAA ^ A ^"AATTCTTTTTTAAATAAAATACCAGATAGAAGTGCTATT ISrsIl aiHeq 
«M9 AAAACCAGTTTAATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATT cohlall sea 
J049 AAAACCAGTTTAATTCTTTTTTAAATAAAATACCAGATAGAAGtGGTATT dm alHea 
1051 AAAACCACTTTAATTCTTTTTTAAATAAAATACCAGATACAAG T ^ " 

1050 AAAACCAGTTTAATTCTTTTTTAAATAAAATACCAGATAGAAGTGGTATT 1^9 all 

CTCCAAGAGGATTTTATAATC AATCGTCATGATTATCAAATAGTAAAAGG Majority 
_ 81 . 1Q 8120 8130 8140 ~8150 

S£ ^J^ AAGAGGATTTTATAAT CAATGGTGATGATTATCAAATAGTAAAAGG 2603 ail.seq 

tnqq G ^^^ A ^ GAGGA 31^^^ AT AATCAATGGTGATGATTATCAAATAGTAAAAGG ISrsltJ !^eq 

Si^SMStSSflJI* A T AATC AATCGTGATGATTATCAAATACtAAAAGC cohl.aTl.seq 

»99 CTCCAAGAGGATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGG ciblTl all sea 

\\n CTCCAAGAGCATTTTATAATCAATGCTCATGATTATCAAATAG™^ 2S 6~a Use^ 

1100 CTCCAAGAGGATTTTATAATCAATGGTGATGATTATCAAATAGTAAAAGG a909_a7l.seq 

ACATGCAGAGAGTTTTAA ACTGTTTTCCGA-TAGAAAAGTTCCTGTTACTG Majority 
81 . 60 817Q 8180 819 0 8200 

AGATGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAAGTTCCTGTTACTC 2603 ail seq 
AGATGGAGAG.AGTTTTAAACTGTTTTCGCAXACAAAAGTTCCTGTTACTG 18rs21 ail seq 
AGATGGAGAGAGTTTTAAACTGTTTTCGGATAGAAAACTTCCTGTTACTGcohlaTl seq 
AG ATGGAGAGAGTTTT A A ACTGTTTTCGGATA'G A AAAGTTCCTGTTACTG cjblll aHseq 
AGATCCAGAGAGTTTTAAACTGTTTTCGCATAGAAAAGTTCCTCTTACTG ne«316~ail "seq 
AGATGGAGAGAGTTTTAAACTGTTTTCGGATACAAAAGTTCCTGTTACTG a909_aU.seq 

GAG'GAACGACACAAGCA CCTTATCGAGTACCGCAAAATCAACTCTGTCTA Majority 
• ZZ }° . 8220 8230 8240 8250 



(144 
$17 
1149 
1149 
tlSl 
150 



194 G A G G A A C C A C A C A A G C A C C T T A T C G A G T A C C G C A A A A T C A A C T C T C T G T A 2603 all. seq 

967 GAGGAACGA G AC.A AG C A.GCTTATCG AG T- A GCGCAAAATCAACTCTCTGTA ISrsil ailTseq 

VLl S A 5J AACGACA CAAGCAGCTTATCGAG.TACCGCAAAATCAACTCTCTGTAcohI all. seq 

™? $ AC $ AACGAC ACAAGCAGCTTATCCACTACCCCAAAATGAACTC.TCTGTA cjblll ailTseq 

201 CAGGAACGA-CACAAGCAGCTTATCGAGTACCGCAAAATCAACTCTCTGTA nem316~ail.seq 

200 GACGAACGACACAAGCAGCTTATCGAGTACCGCAAAATCAACT.CTCTGTA a909„a7l .seq 

ATGAGTAATGAGGGATAT CCAATTAATAGTGCA. TATATTTATCTCTATTG Majority 
^270 * 8S80 8290 8300 

j*£ A T G A G T A A T G A G G G A T A T.G C A A T T A A T A G r G G A T A T A T T T A T .C T C T A T T G 2603 ail.seq 

1*1 A 5 AGTAATGAGGGATATGCAATTAA T A GTGGATATATTTATCTCTATTG 18rs21 aii!seq 

249 ATGAGTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTGTATTG cohi all, sen 

249 ATGA GTAATGAG GGATATGCAATTAATAGTGGATATATTTAT'CTCTATTG cjblll ail!seq 
251 ATGAGTAATGAGGGATAT-GCAATTAATAGTGGATATATTTATCTCTATTG nemaie^ail .seq 

250 ATGACTAATGAGGGATATGCAATTAATAGTGGATATATTTATCTCTATTG a909_aTl.seq 

C A G A C A T T A C A A.CTGGCT C T A T C C ATT T G AT G C T A A G A C A: A A G A A A G T T T Majority 

- 8310 -. ' - . 8330 8330 8340 . 8350 

2 4 G A « A 9 A T T A C A A n C G G t : ci A T C C A T T T <3. j^f C.C T A A G A C A A A G' A A A G T T- T 2603 ail seq 
D67 GACAGAT.TAC A • A C. T *G IGx G; T'C.|:A^t4AlrT T .G- A C • T: A A G .A G A A A ; ^G ; A A A -G TTT 18rs21 all^eq 

^9 GAGA G A T T A C A; A C T ;G 6 G* $-&$ ; $: : &G : &J$^ TUT^ 0^0iC^t^A A-G A" £ .A^;A"lA?i 1 •'A :, 'AS^AV G* ^T'TvT' coTil^aTliseq^ 
299i GAG A G A T T A G; A.AvCy^ ti^A^^^ 
301 G A fG A G A T T A 0 .J! 
3QO G A C AG AT T • A C.. i 

. . £ T G G A A c G A A A^^4AT,c^^ ; ^ ,„ .y v 

; 8 y° . 8380 ^ . . . 83 90 \ 8400 

§44 C CAACG A A A G A AST C A A A AG T CAT.' G . G T G A G ' .G C A- A CA A ^ A T T A tWt'TT 2603 ail sea 
117 C.T G C A A C C A A A C A A A T C A A A A C T.C A.tG G T G A G C G A A C A A £ AT* AT A C T T T 18rs21 alCeq 
»1« O X G C A A C G A A A C A -A A T C A A A A C T C A T G G T G A G C C A A C A A C A T T A T A C T T T " cohl al 1 -ieq 
349. CTGCAACGAAACAAATCAAAACTCATGGTCAGCCA AC A ACATTATACTTT cjblll ail^eq 
^ I S C A - ° ° A A A C A A A T C A A A ACTCA T G G Tic AG C C.A AC AA'CATT A TACTt.T ne«316~aiKseq 
350 C TC C A A C C A A A C A A A T C A AAA C TC A T G GT GAG C C A AC A A CAT T A t AC T T T a909 all seq " 
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A A T G C A BE^ESHSl B5 y 'Ali^C&i^j 'RcATTTTTACTCTTCCCATTCC Maiorltv 1 ~ 

84,0 8420 . . 8430. 8440 8450 



Iftt AATGGAAATATAAGA CCTAAAGCTTATGACATTTTTACTCTTGGCATTCG 2603 all seq 

8167 AATGCAAATATAAGACCTAAAGGTTATGACATTTTTACTGTTCCGATTGC ISrsit a Hea 

3£ AA I^ AAATATAAGACCTAAAGGTTATGACATTTT TACTGTTGGCATTGC cSu 

8399 AATGGAAATATAAGACCTAAAGGTTATGACATTTTTACTGTTGGGATTGG clbUl "iHea 
8401 A A T G C A A A T A T A A C A C C T A A A C G T T A T G A C A TTT TT A C T G TT G C C A T+ G C nl»3 6~a "s2 

8400 AATCGAAATATAAGACCTAAAGGTTATGACATTTTTACTCTTGGCATTGG ^O^lT! l^T 

TCTAAACCCACATCCTCCT G CAACfCCTCTTCAACCTGAGAAATTTATCC Majority 
, UfjO 8470 8480 8490 • 8500 

^ I AAACGGAGATCCTGGTGCAACTCCTCTTGAAGCTGAGAAATTTATGC 2603 al 1 sea 

£la I G I AAACGCACATCCTGGTCCAA CTCCTCTTGAACCTGAGAAATTTA.TCC«,hl air^ 

t£l I G I AAACGGAGA J CG I GGTGCAAG TCCTCTTGAACCTCAGAAATTTATGC cJbin"lHeq 

l^l TCTAAACGCACATCCTGCTGCAACTCCTCTTGAACCTGAGAAATTTATCC ne*316 all.seq 

B450 TCTAAACGGAGATCCTGGTGCAACTCCTCTTCAAG'CTGAGAAATTTATGC a909_all.s^ 

A A T C K<k TATCAACT A A A A C A G A AA A T T A T A C T AATCTTCATCATACAAAT Majority 

. 8S . 10 8520 8530 8540 ^2550 

^7 AA 2: GAA I A T?AAGTAAAACACAAA.ATTATACTAATGTTGATGATACAAAT 2603_al 1 . seq 

. AA TCAATATCAAGTAAAACAGAAAATTATACT A A T G T T G AT G A T A C A A A T 18rs2I all sea 
ESS AATCAATATCAAGTAAAACAGAAAATTAT AC TAATGTTGATGATACAAAT coM in s£ 
4Sm A ^ZS AA J A T GAAG ^ AAAAGAG AAAATTATACTAAT'G"TTGATGATACAAAi U lll^eq 

^ A A ^w^ A ^ * f AAAAGAG AAAA'TTATACTAATCTTGAT.GATACAAAT ni^^alLs^ 
SS00 AATCAATATCAACTAAAACACAAAATTATACTAATCTTGATCATACAAAT aSWOjall .seq 

AAAATTTATCATCACCTAA ATAAATACTTTAAAACA AT TGTTGAGCAAAA. Majority 
8560 8570 8580 8590 - . 8600 

_ ' . _ _ . _ _ ■ -J-i — ' * I . 



»J* A A A A TT TA T G A T GA G C T A A A T A A A T A C T A A ACAATTGTTCAGGAAAA 2603 ail. seq 

«1I AAAA ^TT ATGATGACCTAAATA AATACTTTA AAACA ATTGTTGAGCAAAA ISrsil alt^eq 

^ A A A a £ I I * I S A "IS A C ° T A A > T A A A A C T T T A A A A C A A T T G T T G A G G A A A A cohl_a7l .seq^ 

£ki * AAA ^H A ^ A T G AGCTAAATAAA TACT.TT.AAAACAATTGTTGAGGAAAA cjblll allien 

AA f A n^ A ^ A ^ AGCTAAATAAATACTTTAAAACAATTGTTGAGGAAAA nenjSie'ail.sS 
*550 A A A A T TT A T G A T G AG C T AAA T A A ATA CTTTAAAACAATTGTT'GAGGAAAA a909._all.seq 

ACAT TCTA T TG TTGATGCAAATGTGACTGATCCTATGGGACAGATGATTC Majority 
• 8 3 i0 8620 863Q 8640 8650 

*H A I A l I S T A T T G T T G A T G G A A A T G T G ACT G A T C C T A T G G C A G A G A T G A T T G 2603 ail .seq 

Ar ATTr t i t t GTTGATGGAAATGTGAGTGA TCCT AT GGGAGAGATGATTG 1.8rs21 aHeq 
^ill^l^llil r GAT 9^ ^^^^T^^CTQ AT CCT AT GQ^ A GAQ XT C ATT C cdkt all.seq^ 

^ H Itl J TTGATGGAAATGTCACTCATCaATCGGAGACATGATTG cjblll ait^eq 

! fl ^r C I!II^ T ' ATGG A A ATGTG ACTG ATCCT AT GGGAGAGATGATTG nem31G"ail .s^ 

IG00 ACATTCTATTGTTGATGGAAATG TG A..C TGATC. CTATGG GA GA- GATGATTG a909_a7l. seq 

A A T T C C A A f T A A A A A A T G G T C A A A GT.TT X A C A C A T G A T G A TT A C C T T T T G Majority 

_ figgQ $670 . . . 86:80 8690 8700 

Sfi f tt l t 5 A A T ^ ' A '- Kl ^ K -^'^'TP A A A G T;ii T ?T A jG A C A T G A T G A T T A C G T TTT G 2603 ail. seq 
V A J I S C A A T T AA MX f/AT C C T C A A- A G T T T T A C A G. A T G AT GA XT A €/G T T TT G I8rs5l ail^e 

J649 AAtTCCAAT 
J650 : ; A A^T-.T' ,; c7e * Ai7k: 



1694 
1467 
1699 
1699 
170 i 
1700 



,C T T G G A A A,.t .fe/A/f ^CjG^C ^A -C ^C -;A °A "T.T A A AAA A^ G^F G T .fe.,G G T C T/t ^G T G. A C € Majority 

■ ' . ; , .87M.y : <; ; -ifeo '., S73o: t • 8740 . .-* '• siso 

5-'? A A ^^ G >l^^ ,A :: G ^^*^t^ A. A A'A A'AT.'G GTGT G G C T CTl4 fC.G AGO 2603 ail. seq 
G T.J G G AAA x c A ^ c G G A G.t G A A T T A A A.A*A A T G GTGTG.G C T C T T G G T G G A C C 18rs2i ailseq 
5 J I 5 G a A AT C A T^G G C A C T G A A T T A A A A A AT G G TGTGGCTCTTG G T G Cf A C C cohl_ail seq 
? I In ^ A f a T G A T GG ° A G T G A A T T A A- A A A ATGGTGTGGC TCT f G G T G G A G C cjblll ail!sfeq 
1J- { J M«Z £•?"•£ A G T C A A T T A AAAAATGGTCTGGC.TCT TG GTGGACC ne^ieZail.se^ 
G T T G G A A A TG A T G G. C A G T C A A T T A A A A A A T G G T G T G GCTC T f G G t G G A C C a909 ail seq^ 
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Majority 



8744 
8517 
8749 
8749 
8751 
8750 



8794 
8567 
8799 
8799 
8801 
8800 



8844 
8617 
8849 
8849 
B851 
B850 



8800 



AAACAGTGATGGGCCAATTTTAAAACATGT TACAG T G A C T T A T G A T A * 
AACAGTGATCCCCGAATTTTAAAAGATCTTA T 

aaacXgtgatgggcga 

AAACAGTGATCCrrrAAT^T 

aaacagtgatgggggaattttaaaagatcttacagtgacttatgataaga 



2603_ail.seq 

18rs21_all.seq 

cohl_ail.seq 

cjblll_ail.seq 

nem316_ail .seq 

a909_ail.seq 



CATCTCA AACCATCAA AATCAATCATTTCAACTTA GCAACTCCACAAAAA Majority 
88 .'° 8830 8840 " 

CATCTCAAACCATCAAAAT^AATCA ^U.seq 
C A t c *r f* a a Arc A*rr»A * aa^.^77 11UAA ^*TAGGAACTGGACAAAAA 18rs21 ail sea 

A i c 1 CAAACCATCAAAATCAATCATTTGAArTTAr , r > a n r t r r* . ^ * * a * . . T 

CAICTCiiACCATCAAAATCAAT i c ° ^7?"^ 

CATCTCA A ACCATCAAAATCAAICATTTCA TT GU U ! CJ * I"* ^ 

CAICTCAAACCATCAAAArCAATCATUuicnuUuJ^J^ii:^^ 

^±±1 T C T TACCTATG *TGTAC C TTTAAAACATAAn T ATATAACTA An A A Majority 



8860 



8870 



8880 



8890 



8900 



CTACTTCTTACCTATGATCTArrT 2603_aU. Seq 
GTAGTTCTTArrTAT^AT/.-r* X* i** AAAAOATAACTA TATAAGTAACAA cohl_.ail.seq 

CTAGTTCTTACCTATGATGTACGTTTAAAAGATAA'CTATATAAGT a909^aM !seq C< ' 

ATTTTACA AT AC AAAT AATCG TACAACGCTAACTCCGAAGAGTCAA AA Ac Majority 



8910 



8920 



8930 



8940 
_u 



8950 

-JL. 



8894 ATTT TACAATACAAATAATCGTACAACCCTAAC TCff AAMPTr i a a a , 

B667 A T T T T : A C A ATACA'AATAATCGT J J GA T • 2603 a U . seq 

5899 ATTTTACA-ATACAAATAATCCTAC . U u i8rs21 aiKseq 

5900 A ™ TTACAATA ^^ 



Majority 



J944 
J717 
*949 
5949 
3951 
5950 



5994 
J767 
J999 
$999 
KJOl- 

woo 



AACCAAATACTATTCGTGATT T CCCAATTCCCAAAATTCGTCATGTTCGT 

* 9 - 60 8970 89^ 89 "^q~ ^00 

AACCAAATA'CTATT'CGTGATT'TCCCAATTCCCAAAATTCGTGATfTTrrT t^n* , , 

A A C C A A A TA.CTATTCCTGATTTCCGAATTC Ci A TKC GAT G T^tl'V** 

AACCAA-ATACTATTCGTG. AT-TTCCCAATTCCCAAAATT'CCTGATG i£? "t ^ 

AACCAAATACTATTCGTCATTTG.CCAATTCCCAA A A TTCGTG'ATGTTCGT ^7?^?^ 

A A'C C A'A A'TA CTATT.C6ICATTTCCCA A'T'T CCCAAAATTCGT'CATrTTrrl J JI ^ 

AACCA.A.ATACTATTCGTC^ 



G A G TTTCCGGTA CI A AC C A T C A G T A A T C A G A A C A A A A T 



GGGT GAG 



9010 



9020 
— i— 



9030 



9040 

; t,, 



G T T G A 
— TP- 



Major I ty 



9050 



C AG T T T C C G G f A. OTA A CC AT C A GT A A TC A GA AG A A A A T c'.C GTGA'GGTTr A ?rh<* 

GAG T T T C C G G T A "C ? A X "C C A T r A a a W A A A A IU oU A U$ I £ S^I^eo 
G'A ;t G T ?^ C CGG T A £ T A £ r c M ? k ^ A * ? * G AT A G A AAA -T G G GT.G AG G T T'G A : bohl^seq 

^■^!. T |^6;T ^P^ A:A A A T C G C T^G aVc?C T^T G ^Wooi^s^jr 

ATTf 'ATT A*,* TiA AT 1 A , A A 0:A C A A A^C-A T T G A G A A* C C C T T T T G C f, A C C T A Majority 

:■* • ' • . . 9060 9070 " ~~ - • ' - 

-1 
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KM4 
*817 
KM9 
KM9 
KJ5I 
K>50 



9080 



9090. 



9100 



A T TT ATT A A A t •? TA A T A A A r A Jf.fS.JJ S AC A ATCGCTTTTGGG A G C I A 2603.aU.seq 
A TIT At TAA A -C T T A A T A A AC A r A « A r A I J C A G A A T C C C T T. T T G G G A C C T A 18r S 21_aineq 

A TT T At? AAA r T T A JTliiririiiSf T C CCT TTT'GGGAG C'T A cjblll alHeq 

AT4^??-AAXG?f A A? A AA> Ar Ma S A I J ^ A C A A T C G C T T T T G G G A G C T A ^AeTaU.^ 
A. 1 1TATTAAAGTTAATAAAGACAAACATTG AG AATCGCTTTTGGGACCTA a909 all seq^ 
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«±E_%1_E2^ ' ^PCT/US2005/027239 

G T T T C gaj^EB gglSpg SSHi iS T C T C C G T A T A A C C A A T T T C T T Ma inrity 

91 . 10 S 1 . 20 9« t 30 9140 9150 

8^7 lcTTTr!!rTT^ f C !^ CUAtAUTTTT " UC CT "'* CC ""I"t 2603 all sen 

9099 AGTTTCAACTTCAGAT-AGAAAAAGATTTTTCTGCCTATAAGCAATTTCTT ^ihlTi ^iTZ^ 

9100 ACTTTCAACTTCACATACAAAAAGATTTTTCTGGCTATAACCAATTTCTT aaog_.alI.seq 

CCACACCGAACTCATCTTACA A CAAACAATGATGGTAAAATTTATTTTAA Majority 

. fj™ 8'70 ' 9180 9190 9^00 

9144 CCA CAGGGAACTCATGTTACAACAAAGAATGATCGTAA AATTTATTTTAt " ,, 

BI49 CCAGACGGAACTGATGTTACAACAAAGAATGATGGTAAAATTTATTTTAA J,M ifl seT^ 

BU9 CCAGAGCGAACTGATCTTACAACAAAGAATGATGGTAAAATTTATTTTAA ^bllT llTL, 

9151 CCAGAGG'GAAGTGA. TGT TA CAACAAACAATGATCGTAAAATTTATTTTAA 

9 «50 CCACAGGBAAGTGATCTTACAACAAAGAAICAIGGTAAAATTT'ATITTAA 

AG CACTTCAA. GATC GTAACTATAAA T T ATATGAAATTTCAACTCCAGATG Majority 
9210 9220 9230 9240 ~|>250 
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>194 

3967 



- , 1 | — " ' . »au 

ACCACTTCAA'CATfTT 4 A SZ A !E AAA TTATATGAAATTTCAAGTCCAGATG 2603_all.seq 

H99 AG^CA.CTTCAAGATGGTAACTATAAATTATATGAAATTTCAAC?cCArATr ^ 7?" ^ 
H99 AGCACTTGAACATCCIAACTATAAATTAIAK U^cl GTCC G Jr ^ 7? if 

£_ AGCAC ^? CAACA ? CG in C ? ATAAATT * ?A?GAAA ^ «^6^ :^ 

)200 AGCACTTCAA. CATCGTAACT.ATAAATTATATGAAATTTCAAGTCCAGATG a909_all.seq 

CCTATATA CACGTTAA AACGAAACdTGTTGTGACATTTACAATtCAAAAT Majority 

■ 9 260 9270 9280 . 9290 9300 

1244 C C T A T ATA CAGGTTAAAAC-GAAACCTCTTGTGACATTTACAAT TCAAAAT 2S03 all *u*n 

1249 CCTATATAGAGftTT A A A' A r r A A A ^*'^^^*^'?"'*^^^*^*^Jc A A A AT' 1^8rs2Lail^eq 
Sjg C C ? t I A T A ? ACG II AAAACGAAAC CTCTTGTGAGATTTACAATTCAAAAT cohl_ail.seq 
IZ49 GCTATATACAGGTTAAAACGAAACCTGTTGTGACATTTACAATTCAAAAT clblll all-Li 
^0 CCTATAT AiACGTTA 1 A' A G G A A A G G ^ G ^ T G T G A C A ^-T T AC A A T T C A A A A T nea&16Zail!seq 

1250 GCTATATAGAGGTTAAAAGGAAACCTGTTCTGACATT T A C A ATT C.A A A A T a909_.ail.seq 

C C A G A A G T T A C G A A C C T G A A A G C A-G ATC C AAAT.CCTAATAAAA AT C A A A T 'Majority 

93 , 10 9320 s33o 954,, 93s0 

294 GGAGAAtJTTACGAACCTGAAACCAGATCCAAA T.G CTA'ATAAAAATCAAAT 2603 all sea 
2I9 f r AA r^ A ^!^J TCAAACCACATCCAAAI « T "" 4A * A " A fS"n!1ea 
I99 GGAGAArTTArrAArrTrA AA r^'^ GATGGAAA ^ GG ^ AA ^ A ^ cohl_aTl .seq 

GGAGAAG II ACGAACCTGAAAGCAGATCCA AATGCTAATAAAAATCAAAr cjbUl allTseq 

300 GCAGAArTTArrA A rrTr AAA !*^ AGA ^ GGAA n^ieZaH.se^ 
300 GGAPAAGTTAC.GAACCTGAAAGCAGATCCAAATGGTAATAAAAATCAAAT a909_ail.seq 

C G C G T A T C T J G A A G G A A A T C C T A A A C A T C T T A T T A C C A A C A C T C C C A A A C Malor.tv 

- S iF° 9370 ' . 9380 939Q . 9400 

.W ° 'ft% I A I S 1 3 G C .A A A T'C.C.TA AtfC AICTIAITACCA A C A C T C C C A A A C 2603 aU.seq 



349 
JS0 




GCCC ACC AC CTCT TT IT'C ETA A A AC AvC' G G ..C1C.: A' A T T C G T A A AT T. G T .CI TAT Majority 

" 94-10 9420 , ■ 9430 944Q . " \ 9.50 

» 4 C C CCA c C A GP-T-GTT TTTCCIAAA AC A G G G G G A A T T C G T A C A AT 1 C T CTAT 2603_aU,seq 
!«Z ' 5 ACC A G GJT GTTTTTCCTAAAACAGGGGGAATTG GT A C A AT ICTCTAT 18rs21 ail^eo 

»9 GG.C.CAC G A GG.TGTTTTTCCtA A A AC AGGGGCA ATTGCT AC'AATT.G TC T A T cohl all sea 
199 CCCCACCAGGTGTTTTTCCTAAAACAGGGCGAATtGbTACAATTGTCTA? e^llt "iHea 
C C C .C A G G A •« T «» TT T C C T.A A A A C A G G G G G A AT f G G T A C A A T T G T C T-A T ne«3 rail's^ 
C C.C C A C C A G C T G T T T T T C C T A A A A C.A G G G G G A.A T T G G TACAATTGTCTAT a909 all 



101 
100 
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^^2^t^ 1 5^9M 2006/0783l8atfK>d *^* We ^ ,ted ^sidue we^httable. ^PCT/US2005/027239 

" " ^X^r,tt)m^m t A E£g g Ui TACC ATTTGTTCTTTCCGTCG 

9460 9470 9480 9430 9500 



A T A T T 



J500 



9444 AT ATTAGTTGGTTCTACTTTTAT€ATACTTACCATTTCTTCT TTCCCTCG 26m »n _ 

MSI ATATTACTTGGTTCTACTTTTATGATACTTACCATTTGTTCTTTCCGTCG nJ£ tl* I !2 
J450 ATATTAGTTGGTTCTACTTTTATGATACTTACCATTTGTTCTTTCCGTCG a9^^aH !seq 

TAAACAATTGTAAGCTCTCGTTCAAAT T ATTAAATAATACAAAATCACTA Majority 
9S . 10 9520 9S30 9540 95S0 

\£i » 1 II G I AAGGTG TCGTTCAAATTATTAAATAATACAAAATCACTA2<503 all.seq 

^1 I AAA ^Mn^J AA ^ GTCTCGTTGAAATTATTAAA1 : AATAGAAAATC ACTA ISrsil a itseq 
wqq 5JiiSf"I""S" CTCCTUAU " l ""M*«i*i4iiTCiCTAa»l aU.se* 
isni J AA ^ G . AA Z^ G ^ AAG ?^ G ^ G< *^^ G AAATTATTAAATAATAGAAAATGACTA cjblll alHeq 
I AAAGAA II G I AACGTCTCGT TGAAATTATTAAATAATACAAAATCACTA nls^aU sS 
TAAACAATTGTAACGTGTCGTTGAAATTATTAAATA AT AGAAAATCACTA a909_ai"l.seq 

GJTTTCCCCTTTCCCTATTGTC ACTCAGATTACTTATTATCAACCTTCTC Majority 
9 ?° 85. 70 9580 9590 9600 

C TJ3I GGC ^ TTTCCCTATTGTCAGTCACATTACTTATTA TC AAG CTTCT 2603 all sen 
C .IIII GGCCTTTCCeTATTGTCA GTCACATTACTTATTATCAAGCTTCTC ISrsil a Hen 
GTTTTGGCCTTTecCTATTGTCAGTCAGATTAGTTATTATCAAGCTTCTC coM altseo^ 
G JIII CGCCTTTCCCTATTGTCAGTCACATTAGTTA TTATCAAGCTTCTC ojbma fseq 
?^ ^'99 ^ ^ J ^ T G c G T A j x G T. C A G T C A G A T T A G T T A T T A T C A A C C T T C T C neaalff all]seq 
GTTTTGGCC-TTTCCCTATTGTCAGTCACATTAGTTATTATCAAGCTTCTC a909 a7l s£ 



694 

367 



ATCCCAATATTAATCCCT TTAAAACAACACTTACAACCATTGACCGTACG Majority 
96 . 10 9620 . • 9630 . 9640 - 9650 



^I GC . GAATA T" TAATGCC T ? TAAAAGAACACTTACAACCATTGA'CCGTACG 2603 ail seq 
- A I G G C A A I* I I A A T G C C T T T.A. A A A G A A C A G T.T ACAA'CCATTGACCGTACG ISrsil alHeq 
™ A T G CCAATATTAATGCCTTTAAAAGAACAGTTACAACCATTGACCGTACGcohlaTl sea 

^ A I G ^ CAA T A TTAATGCCTTTAAAAGAACAGTTACAACCA.TtGACCGTACG S«3 6~ail"seo 
600 ATCCCAATATTAATGCCTTTAAAAGAACAGTTACAACCATTCACCCTACG a9&9_aU s^T 

CAGATTCAACCACCCTTA CGTCTGGCTAAACCCTACAATCCTAGTATTTC Majority 
96 . 60 9670 9680 9690 9700 

!2 GAGA n CAACGACGCTTAGGTCTGGCTAA A GCCTACAATG CTAGTATTTC 2603 all.seq 
nil GAG ^n^ AACGA ^ GGTTACGTCTGGCTAAAGCCTACAATCCTAGTA TTTC ISrsil all^q 
MO GA r A n CAACGACGCTTAGGTCTGGCTAAAGCCTACAATGCTAG TATTTCcohl aTl.seq^ 
GAGATTCAACGACGGTTAGGTCT G G C TAAAGCCTA. CAATGCTAGTATTTC cjblll all^eq 
GAGA TTCAACGACGGTTACGTCT G.||c T A A AG C C T A C X A T G C T A G T A T T T C ne«3 6~a 
BS0 GAGATTC'AACCACGGTTA GGTC T G G C T A A AG C C T ACAAT'CCTAGTATTTC a^al!"^ . 

T C G A A C A A C T A CTCACTCCACTCA AT C T G T G C T G A C A G A T T C T T A T T C T G Majority 
9710 9720 g730 9740 9750 



S94 
167 



I GG A A : C A A G I A : G T C A 6 T-C C.A CtCl AT C T G T GOT G A G AG A; TTCTTATTCTC 2603 all.seq 
.„ I 5 G f A C A A G T;A G T C.A G T CO AC T C J A J C T.G T C C T G AG A G A TTCTTA T T C T G isrsSl ali^seq 
5» I GGAACAAGTAGTC AGT G 6 AC T C A A T C T CT GC.TG A G A G ATT C TTATT C T G 1 coni^Jseq^ 
Ire ft r $ A G J v A S T 5 G C A ; A * G T. G T. G T -G. A<G; A.GA T- T!.C'T : T A.T/f ; C:T .G. cjbl7iUali^eti 

rOO T G G A A C A A G T A G T C A C, T G G A^*T, C A A T C T C T (? C T C A G iA .G A'T T.C T T A T T C T G •aflift.all seq 
A G G A C C A AA A A AC CCA C G G'-T C G A -C T C.A A T,,A CG C t A G C A TCT T A G A A G T C A Majority 
• __j 9760 ^770 ' • 9780 ' ■ " 9790 9800 

-?v a G G » '"S G A A A A A A G G C A G G G T G C A .C T G A A T A C G C T A G G A T G' T T A G A A G T C A 2603 all.seq 

mo i SS^S S * * A AA AGGC AGGCTGG ACT C A ATA CGCTAGGATG T T AG A A CT C.A .18^21 allleq 

«2 A G G A GC A A a A A A G G C A G G G T C G A C T G. A A T A C G C T A G CAT G T TAG A ACT C A •cohl_ali.seq 

49 AGC'AGCAAAAAACGCAGG.GTCGAC.TGAATACGCT.AGCATGTTAGAAGTCA cjblll allsea 

50 A G C A C r A A A A A 1 S-S C A r 5 r'J £-S A G T C A A T A C G C T A G C A T G T T A G A A G T C A S ^ 
«0 A G.C A C C A A A A A A C G C A G C G T G G A C T G A AT AC CCI.ACCAI C T T A GAACTCA a909_ail. SC q 
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lfaninoninflnisrf . nfJlH WO 2006/078318 ^ ' ^PCT/US2005/027239 

Utgnroent Report of Al-1_aitgnment. using J. Mem method with Weighted residue weight table, p^ ^q 

rhureday. Juty 29, 2QQ4- 5:46 PM ,„ „ t| . . r; . £ , ^ • ragezy 

C A G A € C AgE j C4 ggyg W'A f^Tcfc-rf AAAATCAATCACCATTTACCA Majority 

9810 9820 9830 9840 9850 



>794 CAGAGCAGGTTGACCATGTGATGATTCCAAAAATCAATCAGGATTTACCA 2603_ail seq 

>567 GAGAGCAGGTTGACCATGTGATGATTCCAAAAATCAATCAGGATTTACCA 18rs2l ail seq 

>799 GAGAGCAGGTTGACCATCTGATCATTCCAAAAATCAATCAGGATTTACCA cohl_aU seq 

>799 GACAGCAGGTTGACCATGTGATGATTCCAAAAATCAATCAGGATTTACCA cjblll_aU seq 

W01 GAGAGCAGGTTGACCATGTGATGATTCCAAAAATCAATCAGGATTTACCA nem3l6_aii seq 

>800 GAGACCA GGTTGACCATGTCATGATTC'CAAA AATCAATCAGG ATTTACCA a909__ai 1 .seq 

ATCTA CCCTGGTTCAGAAGACCACAATCTGCAACGGGGAGTTGGTCATCT Majority 
9860 9870 98 80 9890 9900 

>844 ATCTACG CTGGTTCAGAA. GAGGACAATCTGCAACGGGGAGTTGGTCATCT 2603_ail seq 

K317 ATCTACGCTGGTTCAGAAGAGGACAATCTCCAACCG.GGAGTTGGTCATCT 18rs2l__ail seq 

>849 ATCTACGCTGGTTCAGAAG. AGGACAATCTGCAACGGGGAGTTGGTCATCT eohl_ail seq 

«49 A T C T A C G C T C G T T C A G A A G A G G A C A A T C T G C A A C G G C G A C T T C C T C AT C T cjblll_ail seq 

1851 ATC.TACG CTGG THC AGAACAGG ACAATCTCCAACCGCC ACTTGCTCATCT nem316_ail seq 

W50 ATCTACGCTGGTTCAGAAGAGGA C-A ATCTGCAACGGGGAGTTGGTCATCT a909_ail.seq 

AGAAGCGATAA GTTTGCCGATTGGAGGGGCTTCTACACATGCGGTCTTGA Majority 

9910 9920 9930 9940 9950 

I , — 1_ 



«94 AGAAGGGATAAGTTTGCCGATTCGAGGGGCTTCTACACATGCGGTCTTGA 2603_ail seq 

1667 AGAAGGGATAACTTTGCCGATTG G A GGGCCTTC.TACACATGCGGTCTTGA 18rs21 ail.seq 

«99 ACAAGGGATAACTTTCCCCATTGGAGGGGCTTCTACACATGCGGTCTTGA cohl.ail seq 

1899 AGAAGGGATAACTTTCCCGATTCGAGGGGCTTCTACACATGCGGTCTTGA cjbl il_ailseq 
©01 AGAAGGGATAAGTTTGCCGATTCGAGGGGCTTCTACACATGCGGTCTTGA nea3l6_ail seq 

1900 AGAAGGGATAAGTTTCCCGATTGGAGGGGCTTCTACACATGCG'GTCTTGA a909__ail .seq 

CCGGT CAAACAGCTATGCC AG CTCCTCGCTTCTTTCCG.GA.TT T.G CAT A AG Majority 
9960 9970 9980 9990 10000 



949 

1949 
951 



- : : : _-l : I i 

•944 GCGGTCAAACAGGTATCCCAGCTGCTCGGTTGTTTGCGGATTTCGATAAG 2603_ali seq 

•717 GCGGTCAAAGAGGTATGCCAGCTGCTCGGTTGTTTGCGGATTTGGATAAG 18rs2i_aii7seq 

-949 GCGCTCAAACAGGTA TC^CA CCTCCTCGGTTGTTTGCGGATTTGGAT AAC cohl_ail seq 

G.C GGTCAAAGAGGTAT GflC AGCTGCTCGGTT'GTTTGCGUTTTGCATAA G cjbl U_aii. seq 

GCGGTCAAAGAGGTATGGCACCTGCTCGCTTGTTTGCGGATTTG C A T A A G nem316_ail .seq 

GC GGTCAAAGAGGTAT cQ-C AGCTGCTCGGT T'G T TT GCGGATTTGGATAAG a909_aii.seq 

ATCAA AAAA.GGTGATTATTTTT-ATGTTACCAATCTGAAAGAAACCTTGGC Majority 
iPQiO 1002Q 10030 10040 1O050 

994 ATGAAAAAAGGTGATTATTTTTATGTTACCAATCTGAAAGAAACCTTGGC 2603_ail.seq 
767 ATGAAAAAAGGTGATTATTTTTATGTTACCAATC. TGAAAGAAACCTTGGC 18rs21_ail.se 

999 A 
0001 A 
0000 A 

T TAT C AAGT.GCATCG.TA T C A T G G T G AT T G A A C C T A G C C A A TT C G ATGCCG Majority 

' 10060 10070 IQOjBO * 10090 • idioo 

0044 TT ATCA AGTGGA T.G G T A fCA-TGC T G A TT G AAC C T A G C C A A t T G G ATGCCG 2603_*il.seq 
817 T T A T C A A C T G G A T C G T A T C A T G Q T GAT % G. A A' G' .C T A G C C A A T G G A <T G GC G a8rs2tiall.seq 
0O49 TT A T C A A GT.fi G-A T'G G T A T C AT G G T G A T;T*G. A ^A-C C T"A G C.G A ; A IT T~rG G' Ar T G.G G GV i^ifca^^seq < 
D049 TT ATC A * " v * " - *" *" ~ " - ; ~ ~ ~ - ~ • - — * - - - .*.-■'_; : ~- ' " - 

D051 
D050 

. I frAG C kl £jg * Ji £ A^ A A ^teA : ;Tkt:A;C,CX t T*C T >.G A:G:^ 

. lOUO : 10120. • . 10 130 \ 10140 10150 \ 

0094 T G A G CATT G A A G A G G ATA A' A G A tt A TGT T A C C CTTC T.G. A C C T GT AC A C C t ^603_ail,seq 
B67 T G A C C A T T.G A A G AG G AT A A A'G A TT A.TGTT AC C GTTGT G AC GTGT AC A G C t 18rs21_aiLseq 
D099 T G.A ^ CATT G.A A G A G G A T A A A G A T -T A t C T T A C C C T T C T G A C C T'G T ! A C A C "C T odhl£ill seq 
D099 TGA.GC AT TX A AG AGGATAAAGATTATGTTACCCTTCTGACCTGTACACCT cjbl ll_all. seq 
0101 T.G. A CCA TT C.A.A GAGGATAAAGATTATGTTACCCTTCTGACCTGTACACCT neaj316_aa,seq 
0100: TGAGCATTG A AG AG G AT A A AG AT T A TG'TT A C CCTTCT G A C CTCT A'C A'CCT a909_all.seq ' 
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„ • 10 . 160 J"™ »MfQ to'im : ^oo 

T A T A T C C G CTCTTTCTCnCTGT AAI^CCTCACTT^ 3603_aU.se., 

10149 TAIATCCGCTCT T or ATCAIIU!:UCtU cohl_aiI .seq^ 

1015. TATATCGCCTCTTTGTCA^c•^G^^^^ccc^^tr^^^^^ TTAACAACGAG c J"»-*»H2eq 

<°-TATATGCccTCTTTCTCA^.^ A \S T n^ 

AG.AGAATCAGCTTGGTTCTCTCT'CTTTTTGGATGTTC- A AACCGATGACAA 

, '°f 10 10220 10?3 o 102SO 

J0194 AG.-AGAATCAGC TTCCTTCTCTrT^TTTT T^^ . _ ^ ^ ; *— — — l- 

9967 AGiCAAICACCTTttTICICICTblTTTTrriTrTT^ 2603 - al 1 s ^ 

10,99 ACACAATCA6CTTGCTTCTCTCTCTTTTTrri L T?TT^^^'^ A * 18rs21.__U.seq 

10199 ACACAAICACCTIGCTTCTCTCICTTTTTfriTS^ cohl_al , .seq 

1020, AGACAATCACC??. 6? _C.C_C.C_J? _______ "^^ cJb„ t _aU .seq 

»ACACAATCA C CUG C Tr,TCTCTCt"^^I^I^: A :^:i^«: A «^ 

TTTTCCTTTTAAAATTTTTA AACT TCC C AAACCCAA Ar.CCTTCTnr.rTT_ ^ Jocity 

_____ , . »o?8Q JST~ : _____ 

10249 TTTTCCTTTTAAAATTTTTAA A r T T_> ppiii^,? I8i_2__a,,. se q 

s?... __.;;._ ; Will H:"""«J-»«J?«S"JfS sSu'irC 

n.riimn.mt,,,',,,,...,,,,,,,,,. . ,,,,,,,,,.,,, ^ 

____ 1350 .os<o „,_», 

s :«J5?J??i^^':j??:?tKj«s.?sii.?.jnsi-;j«.i__js_ r^-- ' 
:s _; _.i?fii_fi_:« ... ............ H!L s ii_. i ;f . ^"'^ 

1030, ATG'TCTTTGATAAGCTTCTTCCTCGCTTC T-A ATTTCGCATTr r « 1 _ r r (■ 1 c J b *»- a "«« 1 
1 0300.AT GXCXTTG . ATAAC . C . TT€TTCCTCCC ^ 

TAGCTC ACCCCCTTT GTGATGT A AGTTTT.CTACT TCATGA AGCTCCTA A A Majority 
._: 10370 ,038ft ,0390 i 

iomT^^oio^^^^ 

10349 T A G C T G A C G C C G T T T G TC A T GT A A C T T T TS TACT T 'C A T-C A iftl rrl f ! A > 8rs21 - a » 
KJ349 TAGCTCAGGGCGTTTGTGATGTAAGTTTTGTACTTCA *r r A A r r Tr J» ^ ^ seq 
035, TAGCTGACGGCCTTTGTGATGTAAGTTTTGTAbTTdATr Hf ZZrnZ^* c J bU1 - a » 
.0350 TAGCTGACGGC. GTTTGTCATGTAAGT IT TCTA t T T.C A T G A A G GT C C T A A A aSw^aTlUseq^*' 

C A C C A T T T T A AA A G C A C A A T T C C C A C t t c t t a c g tta tg a t C T A T G A A G r __,_■-,..■. 
._ iQfto; ■ ,10420 ; 10430 . ,oi4o . _o.so 

G A C C A T TT :-— - 
G A C C A 
0399 GA CC A 

0399 fi A C C A T : 
OiOl GA.C C A T 

0400 G A C C A f 



TAA^M_I.^_|_|_^ 
0449 T A,A A G A A T T C ACT T G * . fi t C _ t T t C T T i « » A J T r » _ f £ -f *5 $ _' 4T ^ f ° C X G *' V 1 *^*-*"-**, 

^ T ,A*A*AATTC.^ 
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GAGCTCATCTGAGAAAACTTTTATACCTCA Majority 



10510 10520 10530 1O540 10550 



10494 TAAAGGTCGTTATAATAGCCGACCTCATCTGACAAAACTTTTATACCTCA 2603_ail seq 

10267 TAAAGGTCGTTATAATACCGCAGCTCATCTGAGAAAACTTTTATACCTCA 18rs21_ail seq 

10499 TAAAGGTCGTTATAATAGCGGAGCTCATCTGAGAAAACTTTTATACCTCA cohl_aU seq 

10499 TAAAGGTCCTTATAATAGCG GAGCTCATCTGAGAAAACTTTTATACCTCA cjbllLailseq 
10501 T A A A G C T C C T T A T A A T A C C C G A G C T C A T C T G A C A A A A C T T T T A T A C C T C A nem3i6__ai Tseq 

10500 TAAAGCTCCTTATAATAGCGGAGCTCATCTGAGAAAACTTTTATACCTCA a909_all.seq 

AAGTCA CTCTAGCTTTCATATCATCAAGCCATTACGAGTTATTCCTTATC Majority 
10560 10570 10580 1O590 10600 

IOS44 AA^TCACTCTAGCTTTCATATCATGAAGCCATTAGGAGTTATTCCTTATC 2603_ali.seq 
10317 A AGTCAGTCTAGCTTTGATATCATGAAGCCATT-AGGACTTATTCCTTATC 18rs2Uail^eq 
10549 AAGTCAGTCTAGCTTTGATATCATGAAGCCATTACGACTTATTCCTTATC coM_aiI.seq 
10549 AAGTCAGTCTAGCTTTGATATCATGAAGCC A.T T A G G A G T TATTCCTTATC cjblll_ailseq 
10551 AAGTCACTCTACCTTTGATATCATCAAGCCATTAGGAGTTATTCCTTATC ne^316_ail *seq 
10550- A AGTCAGTCTAGCTTTCATATCATGAACCCATTAGGACTTATTCCTTATC a909_ail.seq 

TTSTTAGTGGCCC GCCATCCATATACTGATAGATCCAGATATTTACATCCA Majority 

irtAirt incon *nr+nn. . ^ 



10610 10620 1O630 10640 10650 



10594 



TTTTAGTGGCGCGC.GATCCATATAGTGAT AG ATCGAGATATTTAG ATCC-A 2603_ail seq 
10367 TTTTACTGGCGCGCGATCCATATACTGATA GAT C C A G A T A T T T A G A T C C A 18rs21_iUlleq 
IOS99 TTTTAGTGGCGCGCGATCCATATAGTGATAGATCGACATATTTAGATCCA cohl_aU seq^ 

10599 TTTTAGTGGCGCGCGATCCATATAGTGATAGATCGAGATATTTAGATCCA cjblU_ail seq 
10601 T.T TTAGTGGCGCGCGA TC C ATATAGTG ATAGATCGAGAT ATTTAG ATCCA ne*316_ail .seq 

10600 TTTTAGTGGCGCGCGATCCATATAGTGATAGATCGAGATATTTAGATCCA a909_ail seq 



AAA 



GTTCTATCATCCTCTTTTGCCGCCTTTTTTCCACCAGATAATATTAA Majority 
.10660 10670 10680 _ 10690 10700 



10644 A A A G T T C T A T C A T C C T C T T T T G G C G C C T T T T T T C C A G C A G A T A A T A T T A A 2603_ail.seq 

10417 AAAGTTCTATCATCCTCTTTTGGCGCCTTTTTTCCAGCAGATAATATTAA 18rs21_ail. S eq 

10649 A A A G T T CT A T C A T C C T C T T T T C G C G C C T T T T T T C C A G C A G A T A A T A T T A A cbhl^aU.seq 

10649 A.AAGTTCT A T .C ATCCTCTTTTGGCGCCTT T T TTCCAGCAGATAATATTAA cjblll_ail7seq 
10651 AAAGTTCTATCATCCTCTTTTGGCGCCTTTTTTCCAGCAGATAATATTAA ne*316_ail seq 

10650 AAAG'TTCTATCATCC.TCTTf TGGCGCGTTTTTTCCAGCAGA'TAATATTAA a909_all.seq 

G C T A C C T T G C TC T A AC A A CT C C A G C A C T T T A T T t A C A C C A C C T A T T A A T C Majority 

, 10J10 10720 1073Q 10740 10750 

.0694 GGTAGCTTGG.TC.TAACAACTCCAGCACTTTATTTACACCACCTATTAATG 2603_all.seq 

0467 GGTAGCTTGGTCTAACAACTCCAGCACTTTATTTACACCACCTATTAATG 18rs21„ail seq 

.0699 CCTAGCTTGGTCTAACAACTCCAGCACTTTATTTACACCACCTATTAATG cohl_ail seq 

0699 GGTAGCTTGGTCTAACAACTCCAGCACTTTATTTACACCACCTATTAATG cjblU_ail.seq 
0701 GGTAGCTTGGTCTAACAACTCCAGCACTTTATTTACACCACCTATTAATG nem316_ail seq 

0700 G G T A G C T T G G T C T A A C A A C T C C A G C A C t T T A T T T A C A G C A C C T A T T A A T C a909_ail.seq 

GAAA.CTACACGA GTG A G A TTC AAGGTAT. T G G C A C A A C G A T T AA G TCACAA Majority 

. ♦ J076Q 1Q77Q ' * ;1978P 10790 • 10800 

0744 C A A A C T A C A C € A C T G A C A T TGi A AG C T A T T G G 6 AC A A CC ATT A AG TC A € A A 2603^aU.seq 
0517 C AAA C T A C A C C A C J , A € A" T T: € Ji : . A Q ; C T A lt •T:'G;',G-j&''- : A C A A ~C .,G- A. T A ' A G T C A C A A * I8'rs21_ail seq 
0749 C A A A C T A C A G C ^p^A ; fi AT T^.G ; Ay2L G >C T : A . T .'. jG;'G^' G .A- C A A £ G A T T A -. A G T C AC A A cbhl^iilseq 
VfA9 G.&p* : -C'%&&X C A A. cJMU-al*i.s^b ' 

0751 - C *A >A ArC T'A C A CC ACTC AG AH C A'ACCT AT XCC G AC A ACC ATT A AO T C AC A A ' neai3l6_ail .seq 
0750, C A- A' A C.T A^A^C.^^A: ^V^^ltfc f A T T A.'CoAV\A C^G/'A T 't A* G T C A € A A aMjaiiks^ 

- • ^&X;C;.^ T^^^fe-T^^^G k^A#i^iA'A%fi;t AfG'^G^. A -A G t T C A G Majority " 

■ J 10^1° . , Vi 10820 .. : 10830 ,10840. .10850 

0794 A C C G G A A T C G A t T T t G A C G G T T A C G G A T A A A A A G A G G A G O A A G T T C A G 2603_;alKseq 

0567 ATT CC 0 G A A TC G AT TTTGA C G CTTAC C.G A T A A A A A GA G C A G G A ACTTCAG 18rs21_aiilseq 

0799. A TT C C C G A A T.C G A T t T T G A C .Q G T T A C G C A T A A A A A G A G C A G G A A G T T C A G coKl_all.seq 

0799 ATTCCGGAATCGATTTTGAC G£T T A C G G AT A A A A A G A G C A C G A A C T T C A G cjblli_aillseq 
0801. AT.TCCG G A A T C G A T T T T G A C gBt.T A C G CAT A A. A A A GAGC A G G A A G T T C A G ne«316^.ail.seq 

0800 A TT C C. C. G A A T C G A T T T T G A C G G T T A C G G A.T AAA A AG A G C A G G A A G TTC AG a909.ail.seq 
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$^»2&¥£^ . ^CT/US2005/027239 ^ 

C A T T A A pE gEi jj S3 g ^jffifH gfe C C T T A G T A C. C T C C C A C C T T 7T- ^^ ~ ~ 

•_ i2ff2 ^fTO 10880 ,0890 ioloo 

1^7 CATT^r^^ TrCACCAA ^ CTAAAC AAG ^ CTTAC TAGCTCCGACCTTCA 2603 all sea 
10617 CATTHCAlCJTTCACCAAGCTAAACAAGCCTTAGTACCICCrirrTTf 1 7« « -f^ 

CCTTCTCTAAACCCACAACACTAGCCGCAGA T CATCAACTACAAGGAr. AT Majority 
- 10 ?'° 10920 1Q93Q 10940 ic^O 

^CCTTCTCTAAACrrAr^ 2eQ3.ll.seq " 

SI I i 

lOSOO C^t?^^^?^^a^^^^^^ACAC?^^ A ^ ''''^''iCiitcitAT ne^l6 — al l^aeq 

IO9O0 CCTTCTCTAAACGCACAACACTAGCGGCAGATCATCAACTACAAGGAGAT a909_all.seq 

T TC A T T .C CTGTCACCAAAGAGAC C'ACAGTCGGTCGCACAACCCTTACCTT Majority 
10 ^°- '°?70 . 10980 ,0990 ' 1^000 

0949 T TC X tTCf?^rJ?^ A ^ A ^^f CAeTCGCTC " ACAACCCTTAC "T 18rs2I all^eq 

OS50 ITCAIICCTCTCAGCAAACACACGACACTCGGICCGACAA cQ dtl A CCT T a909LaU Iseq 6 '' 
T GA CAACCTTAAACCTCCATTTTA T GA CC-T. TAAACAAacQAAACCCCCCA Majority 
11P1P , "020 11O30 U040 IToSO 

0767 TGACAA'CCTTAAA r'r Trr A t t t A G A C-C T T A A A G .A AACCAAAGCGCCCA .2603 all.seq 
S T ActACC^ AAA ^^ G ^I^I ATGACCTTAAACAAACCAAACCCCC « A ISrsil all^eq 

E?:iis i iff ; ? Hik""""""""""'"!""*!^^ 
s;:i|uss;;iSis:;::in;;i;:iss;uii»2i::j2i:s:ss:i3S^ 

ATCCTTACCTA.CTTCATCCTAAUntAtGT T CTCCTCCTTCAAAATTCA- lhtnr.rv 

> - • - H ? 6Q U 070 11O80 H09O TllOd 

1044 ATGC TTACCTACTTGATCCTAAGACTTATGTTGTCG TCC H CAAAATTr A ran i , 

lu * y A 1 ^ * I A C G T.A .CTTGATCCTAAGACTTA TGTTGTGCTCGTTrA a a a t t n a _-k<7* - « 
OS1 A T C C T T A C G T A C T T C A T C C T A A C A C T T A T G T T G t C G T C 0 11 C i " n^ltTi , ^ 

lOSO ATCC.ITACGTACIT G ATC CT AAG A CTT Alt IT GTCGT.CCTIC A A 1!tT:C A aO^^aH !'seq 
G C A A A A A C C AC A A T TC TCGATCAAC C T A A C T T C A A A C A G C G T G A T T A C C. C. Majority 
• ' . "-f20 U130 U140 iiToO 

G A A A A A CQti C A A T T " CC A T G A A G C T A ACT TCAAAGAGCCTGATTACCC ZfiOT all <^ 
*£ 5* A AA * A.G.6-A-P A AT T C T GOAT G A A € C t A. A CI T C A A A C AG GC'TGATTACC C St IiTIn 
^90 C A AAA AC 6AGAA;T K C>^ A T.C, A. A ,G C t A AC T T C A A A G A G 6 C T G AT T A C C £ conf in ^ 
1099 G C A A A A AC G A C A ATT * * C G A * G A A G C t A A C T T-C A A A GA'GGCTC ATT A r r f ^.U ■ 
ifOl G C -A A A A A C C^A*^^^ a AA C A G G C t'5 ATT a £ > r ^!£HJ^ 

A A T b G C * ' d :Ai ^ A ; :A : ^; fA ; ^ ;ifc ;^; A ;:^ GT tf « * C T CCCTA G. C G T T C C T.A C A A. C G A A Itolorl^ 
UfSti; , ri?70 " 11180 U190 U2O0 •' 

A */f C G^ T G A 1 A A T A ^ ^ ^tJST" G A ? T « C « T * ■ 6 C G T T G C T A C A A'C C A A 26b3_all.seq 

l* : ii7e«CTei?iJ?irr ACcCA.T CT GCAGT.C.CCTAGCGTTCe.TACAACCAA ISrrtUrtTL, 

149 A A ?-l C C t C A ? A A ? A r r « ^ S S ^ *I ° ^ ^ G A G T C 1 C ° T A G C C T T C C T A C A A C C A A cohl-all.seq 
i« A" A t G.G C T G A T A A T AC C A G C C A T & TG CAGT6cItaGGCTTGC I AC A AC G A 1 

150 AAT G G.CT.G A T A A T AC C AG, C AT G T G G A G T .C C C- T A G C C T TG C J A C A A C C A A affi^eT 
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T^^J<^ O ^ 1 5^M 006/0 - 78318m8 * 0d ^"Weighted residue weight tabte. ^PCT7US2005/027239 

CCAUC C f " T A m * I ES Q ^ ****** CTTTTTCAAGC ATTTT6TCT Mam., „ 

"f 10 "?20 11230 II240 ~U2S0 

11194 C C A A A G G C T A A A A A T C C T T T A T T T T A A C C A C T T T T T C A A C C A T T T T r T r I rare .. 
10967 CCA4ACCGTAAAAATCCTTTATTTTAACCACTTTTTCAACCATTTTGTCT 2^7 
1U99 GCAAAGGGTAAAAATCCTTTATTTTAAGCACTTTTT C A AGCATTTTGTCT If if ^ 

11199 GCAAAGGGTAAAAATCCTTTATTTTAAGCACTTT T T CAAGCATTTTGTCT ^ 7 f 

K ccaaaggcta^aM^™^^^ 

11200 GCAAACGGTAAAAATCCTTTATTTTAAGCACTTTTTCAAGCATTTTCTCT aSKK^all.seq 

TTATTCAAAACACTCATTTT A ACATAAAAAA.CCTATTA-AAAAACATATTC Majority 
^ 11260 "270 "280 11290 U300 

!JmT ^? A nr AAAA ^ A ^n ATTT ^ AACATAAAA * ACGTATTAA * AAACAT ATTC 2603 ait.seq 

12to TT ATt£ A A A A r irlr A JII IAA ? ATA AA AAA GOT ATT A A A A A AC AT ATT G ISrsi^ailtL, 

11249 TTATTCAAAAGAGTGATTTTAACATAAAAAAGGTATTAAAAAACATATTG cohl all ^ 
SJ? ?i A II? AAAACAGTGATTTTAA ? AT AAAAAACGTATTAAAAAACA^TTC cj^Il aiHeq 

1 1250 TTATTGA-AAAGArTrATTTT AA ^ A »S 
U2S0 TTATTGAAAAGAGTGATTTTAACATAAAAAAGGTATTAAAAAACATATTG a909_.all.seq 

A C G T-cG ACC CTTTCTTT TCAACTGGCTTGCGTAG A-C A A A A A A A T-A G A T A C fl Major! ty 

11310 "320 11330 U340 " 11350 

11294 ACGTGACCGTTTGTTTTGAAGTGGCTTGCCTACACAAAAAAATAGATACG 2803 mi , m 

11067 ACGTGACCGTTTGTTTTGAAGTGGCTTGCCTACACXAAAAAATACATACr I^T 1,?^ 
11299 ACGTCACCCTTTCTTTTGAAGTGGCTTCCGTAGACAAAAAAATAGATACC ^ 

11299 ACGTGACCGTTTG. TTTTGAAGTGGCTTGCGTAGACAA'AAAAATAGATACG ^m'aiHL 

U301 A C G T G A C C G T T T G T T TT G A A G T C G C T T G C C T A G A C A A A A A A A TA G A TA C G he^3 e"a "2 

U300 A. CGTCACCGTTTGTTTTGAAGTGGCTTGCGTAGACAAAAAAATAGAT'ACG ^TOSLaTl Iseq^ 

TCAGAT. AAATT'TCTGGCAT TACCAGAACATTTTTAGAC.TCTTCTCTTTTT Majority 
11360 H370 11380 11390 "7l400 



I 1344 T-CAGATAAATTT'GTGGCATTACGAGAACAT. TTTTACAGTGTTCTCTTTTT 2603 all sen 
Jill TCACATAAATTICrCCCAirACCACAACATITIIACAeiCT T " 1IS2! alHeq 
J C A G A T A A A T T T G T G G C A TT A C G A G A A C A T T TTT A C AGTGTTCTCTTTTT coh! all "eo 
TCAGA TAAATTTCTGGCATTACGAGAACATTTBTACAGTGTTCTCTTTTT ctb Tl aiHeo 
J 3 ! 1 TCAGAT A. AATTTCTGGCATTACGAGAAC.ATTTT TAG AGTGTTCTCTT-TTT ^6 1^ 
11350 T.C A G A TA A A T T T C T G G C A TT.A C C A G A A C.A T T tQt A G AGTCTTCTCTTTTT a909_a7l"eq 

TTAGTTTACGGA'GGAAAAA T ATATATG-GAAAAACACGATT'CACGACTTCT Majority 
U f 10 . 11430 11440 iTIso 

\f* TTAGTTTACGCACGAAAAATATATATGCAAAAACACCATTCACCACTTCT 2603 all _eq 

1167 T.TAGTTTACGGAGGAAAAA.TATATATCGAAAAACAGCATTCACGAGTTCT 18rs21 aiHeo 

J 39 , 9 ITAGTTTACGGACGAAAAATATATATGGAAAAACAGGATTCACGAGTTCT ^oM all ^eo 

1399 TTACTTTACGGAGGAAAAATATATATCGA.AAAACAGCAtTCACGAGTTCT it aiHeo 

1400 T T A G T T T A C G G A G G A A A A A T A T A T A T C G A A A A A C A CG A.'T T C A C G A G T T C T S's^ 
C A T C C A T T C C C A G C C G A A T T C T G G G G A C A A G C T C A T T C A A C A C C A A A C C A Majority 

11460 '"TO . 11480 11490 . USOO; 

1444 c A T.CC A T T C G G A G G G G A A TT C t G G G CvA C A A G C T.C A T TG A A C A C C A A A C C A ZfifV. =ii 

\t%-$tZ t *'f ll%^t kGGGQhK "*Z C T •«•« ** A C A G T C A T T G A A C A C C.A A. A CCA -o^if^T 1 
55!? '£ A • '5 ^ ' G .« A G G » G A. A T.I CTGG.GC A C A A C C T C A T T C A A C A C G A A A C C A cjolTl ai^eo 

1450 C A T C ,C A T T C C -G^A-'G G^G C A A T: T^'C-'T^G G GwG A.C A- A C-C'T G A IT G^CkWa l^ ^i^' 
G C C C AACC GG GTC fr.T AC T A CCA A CT C C * T C G ,T A G C TTT A C JT C A A CC A ."a A A l^lorlrir ' 

: U ?'° . M? 20 ■ . 11530 . 11S40 11550 • 

OC c c : A AC.C G C CI C G T A C T A C C A A G T C G A T C G T A G C TT T A G T C A A C C A AAA 2603 ail seq 
"S £ C A A » G G G C T G G T A C T * C » A A G T C G.A T C G T A G C T T T A G T C A A C C A A A A aiHeq 
1499 G C G C AAUG GG G TCGT-ACTACCAAGTC CATC G T A G C T T T A G T C A A C C A A A A coS 

1499 GCGCAAC.GGGGTCCTACTACCAACTCCAT C G T A G C TTT A CTC A A CCA A A Sb 111 ait^ea 
1501 G C C C.A A CGCGGTGGTACTACCAAGTCGATCGTACCTTTAGTCAACCAAAA nS'lil ^ 

1500 GCGCAACGGCGTGCTACTA.CCAAGTCGATCGTAGCTTTAGTciACCAAAA a9b9 all 
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A^^t R^lof ^1_WO m 2006/07831 8 ,ethod!i; Weighted residue weight table. ^PC17US2005/027239 ^ 

CUCUC P* ** C F *^ I M __f ^1 8 5 35 3 T A C A A A C T C T C C C T A A T C A T _■ w. ^ 

"f"> «1S80 U590 ,,600 

TCCATTTCCGGCGC TCAT CATCTCATCGCTTATCCTTATAGCAAATGTAC Majority 
, _____ «»£0 "630 U64b . „6SO 

11367 rccnn» C CCCOC^:?r r^^l""""" 1 " 1 """"" 2 603_al. .sea 
list,/ ICGAlTTCGG-GCGGTGATGA.TGTCATGGGTTATGCTTATA-GCAAATrTAr' n„^i ,7^ 

11599 TCGATTTCGG-GCGGTGATGATGTCATGGGTTATGCTTATAGCAAATCTAC ,~, ^ 
TCGATTT.C G.G GCGGTGATCATGTCATGCCTTATGCTTA TA GCAAATGTAC ^ 7? if 

UGOl TCCiTIUCeCCCCTCAICATCTCATCCCITAT c 1 £___ fi" l,*™* 

11600 TCGATTTCCCGCGGTCATGATGTCATGCGTTATGCTTATAGCAAATGTAC a^9^.aTi!seq S< ' 

TTCGCCACTTGCCGCA CCAA TTAATCAGTCGGACTGAAACTOAAACCTTG Majority 

11660 iigj UfSO U690 U700 

1644 TTC.CCCAGTTGCCGCACCAATTAATCAGTGCCACTCA AACTCAAArrTTr 7Bm , . 
1417 TT.GGGGAGTTGCGGCA'CCAATTAATCAGTGGGACTGAAACTCAAAG G'T T G SS? i.^ 

1649 ttggggagttgcggcacgaattaatcagtgggXctgaaactcaaacgttg 2Ef » n ^ 

1649 TTGGCGAGTTGCGGCACCAATTAATCAGTGCGACTGAAACTCAAACG^G r^ m ifl 

1650 TTGGGGAGTTGCGGCACC^ATT^AT^^^ G G A C'T- GAA.ACTCAAAGGTTG ne_3l6_ail!seq 
1650 TTGGGGAGT TO C G G C A C G A. A TTAATCAGTCGGACTGAAACTCAAACGTTG a909_.a7l.seq 

AAATGCTGACAAGATTACCATTACCAGT TCAATGCCAAATGCTCAGCATT Majority 

U ?'° "J 20 , "?30 H740 "1^1750 

*^ 94 AAATCCTCAGAAGATTACCATTACCACtTCAATC CCAAATGCTrArrATT \>*nr, ,, 
1467 A A AT C G I C A G A A C A T I A C.C A T T A C C A G T T C A A I G C C A A A T C ft I C i C _ ill 
1699 AAATGGTCACAAGATTACCAT T*A CCAGTTCAATGGGAAATGGTCA*GATT ™hf 7? 

1699 AAATGGTGAGAAGATTACCATTACCAGTTC'AATGGCAAATGGTCAGGATT Tttft?. i 
1701 A'AAICGTCACAlCATTACCATIACCACtlCAAICCClAlTCOTCACCATI 

1700 AAATCCtOAUACAITACCATTACCACITCAATG"!^"..?.:..:^^ 

CCCTTCGAACACCCGAAAGACTA.CATCCTCA AACTCATACACTTCCAAAA Majority 
U T 60 "770 "780 H790 11800 

fs_7 GGGTTCGAACAGCCr A' A A 5^S!E A )r A I?~ TGAAACTCATA ^ AGTTCC AAAA 2603_aU.se,, 
749 GGG?^GG A Ar A rrrr A Mr A r^ A ^ A I^ 18rs21_aU^ea 

^cccTTcca^c.cc^ 

17SOGGGTTGGAACAGCCGAAACACTAGA.TGCTGAAACTCATACAGTTCC.AAAA a909_all. S _q 

,C A AC C T A C T A.T TCTCTCTTTTTAGG A-A A C TA G TTATCCTTCCTATATAGr. Majority 

"810 H82Q . 11830 11840 n.go 

?i94 G A A C C TACTATTCTCTCTTTT.TAGGAAAGTAGTTAT.GGTTCGTATA T A C C ?Jtm a it 

.567 C A A G G I.AC TA TIC T C T C IT T'TTAGGAAA-GTA G TT A T G CTTC'CTAf At AG C S "ifL, 

.799 C A A C C I.AC T A IT CI CTCTI TT T A G C AAA G T AG T T AT C t'TICC T'A TA TAG G i__f _j? ^ 

799 GAA G ST ACTAI.TCTCTC T TT TT A GO AAA CI A CI I At G CI I C G T ATA T AG G t^lll liTtn 
801 G A A G.G T A C T A TT C T C T CT T T T.T A CCA A AG IAGII ATCG TT C ft TAT AT A G G S ^ 

800 C A A ft CI A C I ATT C Tft T C TT T'T'T A CCA A-A GTAGTT . All T 1 CO a^ alf seT 



CTACGGAXCTAT ATCTTT C CT CACATTA CATCTACACATACTACCATC A A - Majority 
11860 ; U870 . 11880 . 11890 ' Ugob 



— — — i 1 : I _j — - __>w 

844 CTACGGAAC.TATATCT TTCGICACATIAC At CI ACAf. 1T1 f Ti r r itt'i I •_»_ 

617 CTA-CCGAACTAT A-T CTTTCCTCACATTACAT CT ACACATAGTACCATcXa tT f t 1^2 

849 C T AC C G A AC T A T A T C T T I C C I C A C'A T*T ACA'TCTACAGATAGTACCATCAA I ti 
SlSCtACCCAACIAIATCTIICGICACATIACATCTA UAncn CCA T G A A _7_ 7? .f* 1 

850 CTACG'GAACTATATr t t ^''^^'^''^^'^''MATAGTACCATCAA n_o3{_~_H "seq 
850 C T. ACGGAACT- ATA -TCTTTCGT CACATTA CATCTACACATAGTACCATCAA a909_a7l._e^ 
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T T T T G C TW #T A B T-fc X ; "fe ''C^ A f "jh 



1 19 10 



11920 



Jhl G f"C AC G A T A T A C T A ACAAATGAAATAT 
1 1930 I 1940 



Majority 



11894 
11667 
11899 
11899 
11901 
11900 



11944 
11717 
11949 
11949 
11951 
U950 



I19SO 



TTTTGCTTATATGACCAAGTAAAGTGAGGATATACTAACAAATCAAATAT 2603 ail seq 
^^^n^f TGACCAAGTAAAGTGAGCATATACTAACAAATGAAAT AT ISrsil ail!seq 
TTTTGCTTATATGACCAACTAAAGTGAGGATATACTAACAAATGAAATATeohlaTl seq 
TTTTCCTTATATGACCAAGTAAAGTC-AGCATATACTAACAAATCAAATAT cjbill aiHeq 
TTTTGCTTATATGACCAAGTAAAGTGAGGATATACT A ACAAATGAAATAT nem316~ai Iseq 
TTATATGACCAAGTAAAGTGACGATATACTAACAAATGAAAT 



T T T T C C 



T a909_ail.seq 



TTATTATCGTATTTGTCCATTTTATCGA 
11960 11970 

JL 



AAAGTTTCCATATTATCATTAT Majority 
11980 11990 12000 



T T AT TATCGTATTTGTCCATTTTATCGAA 
TTATTATCGTATTTCTCCATTTTATCGAA 
TTATTATCGTATTTCTCCATTTTATCGAA 
TTATTATCGTATTTCTCCATTTTATCGAA 
TTATTATCGTATTTCTCCATTTTATCGAA 
TTATTATCGTATTTCTCCATTTTATCGAA 



AAGTTTGCATATTATCATTAT 
AAGTTTGCATATTATCATTAT 
AAGTTTGCATATTATCATTAT 
AAGTTTGCATATTATCATTAT 
AAGTTTGCATATTATCATTAT ne*316_ai 1 .secj 
AAGTTTGCATATTATCATTAT a909_aii.seq 



2603_ail.seq 
18rs21_ail.seq 
cohl_ail .seq 
cjblll__ail.seq 



CTTTCATAA.GATGCAAATATA ATGATACTAGGAGCTAAATATG GATATTT Majority 



12010 



12020 



11994 
11767 
L1999 
11999 
12001 
12000 



12044 
11817 
12049 
12049 
12051 
12050 



12030 
_i_ 



12040 



12O50 



GTTTG AT A AG AT CCAAATATAATGATAG TACGAGCTAAATATCGATATTT 2603 ail seq 
GTTTGATAAGATGCAAATATAATGATAGTAGGAGCTAAATATGGATATTT ISrsil alUsea 
GTTTGATAAGATGCAAATATAATGATAGTAGGAGCTAAATATGGATATTT cohl ail. seq 
GTTTGATAAGATGCAAATA.TAATGATACTAGGAGCTAAAT.ATCGATATTT cjbill ail!seq 
GTTTG-ATAA GA.TGC AAA TATAATCATACTAGGK CTA AATATG GATATTT ne«^16~all.seq 
GTTTGATAAGATGCAAATATAATGATAGTAGGAGCTAAATATGGATATT. T a909 ail. seq 



AAAAAATCAA GAGTATCCTAAGTGCTTTCC. ATTTTCAAATTCA 

12060 " 12070 12080 

J. 



12090 



A A T AT A G Majority 
12100 



AAAAAATCA AGAGT A T C CTAACTCCTTTCCATTTTGAAATTCA AATATAG 2603_ait,seq 
AAAAAATCAAGAGTATCCTAAGTGCTTTCCATTTT G-A AATTCAAATA TA G 18rs21_ai Lseq 
AAAAAATCAAGAGTATC.C T-A AGTGCTTTCC AT TTTGAAATTCAAA TA TAG Gohl^ail.seq 
A A A A A A T C A A G A G T A T C C T A A G T G C T T T CC A TT t T G A A A T T C A A A t A T A G cjblll_all seq 
AAAAAATCAAGAGTATCCTAAGTGCTTTCCATTTTGAAATTCAAATATAG n«a316_ail seq 
A A A A A A T C -A A G A G T A T C C T A A G T G C T T T C C A T T T T G A A A T T-'C A A AT A T A G 



CTAATACTTCTAGAACTTCTAATTGTTTT TCGTC'G A CG 

" " ~" -TP 

12130 



121 lO 



.2094 
.1867 
.2099 
.2099 
2101 
2100 



12120 



a909_aii.seq 

ATATC A ATTTTC Majority 

12140 12150 
j_ 



CT.AATAGTTCTAGAACTTCTAATTGTTTTTCGTCGACGATATGAATTTTC 2603_ail.seq 
CTAATAGTTC.TAGAACTTCTAATTGTTTTTCGTCGACGATATGAATTTTC 18rs21_ail seq 
CTAATAGTTCTAGAACTTCTAATTGTTTTTCGTCGACGATATGAATTTTC cohl^ail seq 
CTAATAGTTCTAGAACTTCTAATTGTTTTTCGTCGACGATATGAATTTTC cjbill ai]Useq 
CTAATACTTCTAGAACTTCTAATTCTTTTTCGTCGACGATATGAATTTTC 
CTAATAGTTCTAGAACTTCTAATTCTT.TTTCGTCGACCATATGAATTTTC 



nem316_ai 1 .seq 
a909_all.seq 



AATCTTAACTCT TACC.ATTCCACCTCCCTTTCCT TA A AG AA A A A A CGTC A Majority 



12160 



12170 



12180 



12190 



12200 



2144. A A T.C T T A A C T G T T A G G A T T C C A C C T C C C.T T f G G T T A A A G A A A A A A G G T C A 
1917 A A T C T T A A C;T G T T AG G A T T C C A C C T C CC TTTG G T TA AA G AAA AAA GOT 



26Q3_aii.seq 
I8rs21_ai f .seq 




CGT'CCTTT AC AT A AC.TTTCTC.AAACA AGCTCA AG C TATCTAA AAATACTT Maloritv 

12250 



12210 
— i— 



12220 



12230 



.12240 



2194 G GT CGTTTAGATA A CTTTGTC A AACAAGCTC A AC CT ATCTA AAAATAGTT 2603 ailise^ 



1967 
2199 
2199 
2201 
2200 



C GT C G T T T A G A T A A C T T T G T C A A A C A A G C T C A A G C T A T C T A A A A A T A G T T 18rs21 ail^eq 
G GTC GTTTA GAT" A AC TT TGTCA AACAAG'CTCA AG C'T ATCTA AAAATAGTT cohl^ai 1 ,seq * 
GGTCGTTTAGAT. AACTTTGTCAAACAAGCTCAAGCTAT C.T AAAAATAGTT cjbill_ail7seq 
G GTC GTTTAGATAACTTTGTCAAACAACCTCAAGCTATCTAAAAATAGT T nem316 ail.seq 
CC T.C CTTTAGATAACTTTGTCAAACAAGCTCAAGCT ATCTA AAAATAGTT a909_a7l.seq 
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^eMReportofAMXQ.a?^ ^PCT/US2005/027239 

T G A A A T c ff S ^2SS@SSg2 IB E|wi AGCTATCTGATCACCA GAAGC Majority _ 

^360 12270 12280 12290 12300 

12244 TGAAATGCGCATTACTCTACTTTTTAATAAGCTATCTGATGAGCAGAAGG 2603 ail seq 

2017 TGAAATGGGCATTACTCTAGTTTTTAATAAGCTATCTGATGAGCAGAAGG 18rs21 all seq 

2249 TGAAATGGGCATTACTCTAGTTTTTAATA.AGCTATCTGATGACCACAAGG cohi_aTl seq 

2249 TGAAATCGGCATTACTCTAGTTTTTAATAAGCTATCTCATGAGCAGAAGG cjbUl aiiseq 

2251 TGAAATGGGCATTACTCTAGTTTTTAATAAGCTATCTGATGAGCACAAGG nem316~ai 1 seq 

12250 TGAAATGGGCATTACTCTAGTTTTTAATAAGCTATCTCATCACCAC-AA.ee a909_aTl.seq 

ACAAGTTAATGCATGTTGGGAAGTCTTATTTTGACTATCAAGAAAATGCT Majority 
12310 12320 12330 12340 12350 

12294 ACAAGTTAATCCATCTTGGCAAGTCTTATTTfGAC TAT CAAGAAAATGCT 2603_ail.seq 
2067 AGAACTTAATCCATCTTGGGAACTCTTATTTTCACTATCAACAAAATGCT 18rs21 aii!seq 
12299 AGAAGTTAATGCATG T'T GGGAAGTCTTATTTTGACTATCAAGAAAATGCT cohi_ail seq 

12299 AGAAGTTAATGCATGTTGGGAAGTCTTATTTTGACTATCAAGAAAATGCT cjbUl_ai iTseq 
12301 AGAAGTTAATGCATGTTGGGAAGTCTTATTTTGACTATCAAGAAAATGCT neo316_alI seq 

12300 AGAAGTTAATGCATG TTGGGAACTCTTATT'TTGACTATCAAGAAAATGCT a909_ail.seq 

CTTATCCCACAA TTAGGTTTTCTATATTCTAAATTAACTAAAAAAATTGA Majority 
12360 12370 12380 12390 12400 

-2344 CTTA'TCCCACA'ATTACGTTTTCTATATTCTAAATTAACTAAAAAAATT G A 2603 alKseq 
2117 CTTATCCCACAATTAGGTTTTCTATAT.TCTAAATTAACTAAAA. AAATTGA 18rs21 allTseq 
2349 CTTATCCCACAATTACGTTTTCTATATTCTAAATTAACTAAAAAAATTGA cohl all. seq 

2349 C T T A T C C C A C A A T T A C G T TT T C T A T A T T C T A A A T T A A C T A A A A A A A T T G A cjblll aillseq 
2351 CTTATC. C. CACA ATTAGGTTTTCTAT ATTCTAAATTA A.C TA AAA. AAATTGA ne*316 ail.seq 

2350 CTTATCC CACA ATTAGGTTTTCTAT ATfCT'AAAT'TAA CTA A AAAAATTGA a909_ail.seq 

ACTTGA TAATCCCTTGTCTCCGACTGAACAAAAGTTATTGATTACCTTAT Majority 

12410 12420 12430 12440 12450 

2394 ACTTGATA-ATCGGTTGTC T'.C C G A C T G A ADAA AAGTTATTC ATTACCTTAT 2603_ail seq 

2167 A C T T G A T A A T C G G T T G T C T C C G A C T G A.A C A A A A G T T A T T G A T T A C C T T A T 18rs21_ail.seq 

2399 ACTTCATAATC G G TTGTCTCCGACTCAACAAAACTTATTG ATT ACCTTAT cohl_ail.seq 

2399 ACTTGATAAT C G GTTGTCTCCGACTG A AC AA AAGTTATTC ATTACCTTAT clblll_ailseq 
2401 ACT T G AT.AATC G GT TGT. CTCCGAC TGAA C A A A A G T T A T T G A T TACCTTAT nem316_all *seq 

2400 A CTTGA TAATCGGTTGT C T CCG-ACT G A A C A. A AAGTTATTGATT AC C T TAT a909_ail,seq 

TATTACATAC T A A AGGTTTAATCATTGATAT CT AAGAAGTAAGTCAGCTA Majority 
12460 12470 12480 1249Q 12500 

2444 TATTACATACTAAAGCTTTAATCATTGATATGTAAGAACTAAGTCAGCTA 2603_ail seq 

2217 TATTACAT.ACTAAAGGTTTAATCATTGATATGTAAGAAGTAAGTCAGCTA 18rs21_ail seq 

2449 TATTACATAC. TAAAGGTTTAATCATTGATATGTAACAAGTAAGTC AG CTA cdhi^ai 1 seq 

2449 TATTACATACTAAAGGTTTAATCATTGATATGTAACAAGTAAGTCAGCTA cjblll_ailseq 
2451 TATTAC A TACTAAAGGTTTAATCATTCATATGT AAGAAGTAAGTCAGCTA nem316_ail.seq 

2450 TATTACATACTAAAGGTTTAATCATTGA.TATGTAAGAAGTAAGTCAGCTA a9G9_aii seq 



A C C G A T C TT* T CT ATTCT AAAACTT ATATTGTTCCTTTAGA AATTTTA AAG Majority 

1,2510 12520 . 125 30 12540 12550 

2494 AC C G A f~C TT TCTATTCTAA AA CTTATAT T G TTCCT.TT ACAA ATTTTA AAG 2603 ail. seq 

2J&7 A CCC.A TCTTTCT AT T -C.T :A AAA C T T A T A T T G T T. G C T T T AG AA ATTTTA AAG 18rs2t_ai i . seq 

£499 A.C C G A,T OtTTCTA T.TC T AAA AG. T T ATA t T G T TG CTTT A GAA A T T'T t" A A A C obhl_ail seq" 

£499 ACC-G ATCTTTCTATTCTAAAA CTTA TAT T G T T G C T T T A G A A A T T T T A A A G cjbill_ail.seq 

2501 .A C.C.G.A TCTTTCT AT TCT A A A, A G T T A TAT T GTTGGT T T A C A A A T t T T A.A AG ne*316_ail seq 

E500 ACCGATCTTTCTATTC T A AAA C.t T A T ATT G T T G C T T T A G A A A T T tT A A AG a909_ail.seq 

A G C G T G C A t G G C T T C A T. A A T A A A C A G A A ATCTTACC AA T T T G C G A A G CC A Majority 

^ 12560 12570 " . 12580 12590 " 12600 

2544 AGCGT.GG AT GCCTTCATAATAA A;C A G A A A T CT TA.CCA- ATTTGCGAAG C C'A 2603_aiUseq 

»17 A G C G T G G A T G G C T T C A T A A T A A A C A G A A A T C T T A C C A A t T T G C G A A G C C A 18rs21_ail7seq 

2549 AGCGTCC A T GG C.TTC AT A A T A A A C A C AAATCTTA.CCA ATT T..G C G A AG CCA cohCail seJP 

2549 AGCGTGGATGG C-T TC A T A A T A A A C A G A A A.T C T T A CC AATTTGCGA AG CCA cjblU ailseq 

2551 AGC GT GGATGGCT'TCATAATAAACAGAAATCTTACCA ATT-TGCGAAGCCA ne«i316 all seq 

*550 AGCGTGGArCGCTTCATAATAAACAGAAATCTTACCAATTTCCGAAGCCA a909_aiKseq . 
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TCTAATACATACTCCACTTAC Majority 
. 12 f 10 '2620 1263Q 12640 126S0 

iA«(liT<TriTiTrTfi'' A G ATCTAATACATACTCCACTT AC 2603_all.seq 

2599 AAAAATATGATATTTGAAGAG, TCCAAACATCTAATACATAGTCCACTTA" 

AGTCCAGTTA 
AGTCCAGTTA 

C aS09_all.seq 



2599 



« < « . . i ? 4- ^ ? i i» a a I* A _■ I t;t-AAAGATCTAATAGATAGTCCAGTTAG cohl all.seq 

AA.AAATATGATATTTGAAGACTCCAAAGATCTAATAGA T.A GTCCAGTTAG clblll alHeo 
2601 AAAAATATGATATTTGAAGACTCCAAAGATCTAATACATAGTCCAGTTAC nem3I 6 "a 
2600 AAAA-ATAT'GATATTTGAAGAGTCCAAACATCTAATAGATACT-CCAGTTAG al^laTl"^ 

ACAAGCCTTCATTATAACTGA T AACCATTTTCAAAAATT A A AACAACAGC Majority 
12G6 ° »2670 12680 12690 12700 ■ 

zt?7 ac^a G ^ G ^I5 A II A I^^ G ^ ATAAGGA ^ T ^^ G AAAAATTAAAACAAG'AGC 2603_all.seq 
^ A J AA ^^^ A JI TAAGT6ATA4GCATTTTCAAAAATTA *»*<;* A <:AGC 18rs21 aiHeq 

S r II I AACICAIAACCATTITCAAAA * TTAAAACA *<: a ':c«1>1 aU.seT 1 

«^^t»^^^ A I^ ATAAGTCATAAGGATTTTCAAAAATTAAA *< : AAGACC cjblll ail^Lq 

^o ACA t G r G n GA n ATAACTGATAACGATTTTCAAAAATTAAAA CAACAGC ne_316~all._^q 

2650 A G A A G C G II GATTATAAGTGATAAGC AT TTTCAAAAATTAAAACAACAGC a909._a7l.seq 

TATTATTTTAACCGACTTATT TTAAAGACTTATCATATCTACCCTTGCTT Malorlr^ 
t2 T 10 t2 P> 12730 12740 127S0 

£ 6 7 ?A??A??^ AACCCAC ^^ A i: TT ^ AAAGAC ^ TATCATATC ^ AC€CTTCCT T 2603_aU. Seq 

I A II ATTTTAACCGACTTATTTTAAAGACTTATCA 1'ATCTACCCTTGCTT 18rs21 ailiea 
UIIAITITAACCGACinTITTAA_CACITAKATATCTA.CCIH.il cohtlaTl seq 



2699 
2699 
2701 



I.II A IIII AACCCACTTATTTTAAACACTTAT ' CATATCTA e««' , TCCTT cjblll all.seq 
^J A J^ ATTTTA A C C GACTTATTTTAAA GACTTATCATATCTAGGC.TTGCTT ne_3l6~aU _e2 
2700TATTATTTTAACCGACTTAT.TTTAAAGACTTATCATATCTACCCTTGCTT alo9_.aU "eq^ 

CATCATTCCGAAAAATACGGACACTATA CTATTTCAACCAAAACATACAA Majority 
. ; lZ7 . e ° 12770 12780 12790 12800 

*** tilt^ll^ f^y i A ^l^^^ GACi ^ TAC ^^^ CJi AC GkK A AOAT AC A'A 2603_ail.seq 
S_a GA i G ^I^ CGGAAAAATACGGAGACTATACTATTTCAACGAAAACA TACAA 18rs2I alltseq 

?49 CATCATTrrrAl..iTirpr A r' A '^ A ^ A ^^ A ^ T ^ GAA ' GA cohl_a7t.seq^ 

2749 CATCATTCGGAAA.AATACGGAGACTATACTATTTCA'AGGAAAACATACAA ciblll ail sea 

^ r_Tf«TTrrr AAAAA ^ A G G C A G A-.C TATACTATT TC A' A G'G A A- A A G AT A C A A ne_316~ail.seq 

27S0CATCATTCGGAAAAATACGGAGACTATACTATTTCAACGAAAACATACAA a909_a7!.seq 

AACTTTCCAA.TCA Act CTTC AACIATACAICCTICA'AACTCATCCCCTAC Majority 
■ 12 ° 10 IZSZO 12830 . 12840 12850 

^TTIT^nrj.^^ all.seq 
S_I AAG III CGAATCAAGTCTTCAACTATACATCCTTCAAAG TCATCGGCTAG 18rs5l all!.eq 
^_ AAG III CGAATCA AGTCTTCAACTATACATCCTTCAAACTCATCCGCTACcohT aU^T 
» J I T iUICTIClACTATACAICCTICA1ACICAT ««TAC cJbUi a Ueq 

^1 AG III CGAATCA AGTCTTCAACTATACATCCTTCAAAGTCATCGGCTAC ne_3i6 _ all. S .. 
MOOAAGTTTCGAATCAACTCTTCAA.CTATACATCCTTCAAAGTCATCCGCTAG a909__7l.seq 

ACATTTCGAATTATCAACCAA T.CCCTTTGATTACTAGAAAAATAAATACC Majority 
- . .. . gggg . ■ 12870 12880 12890 . 12900 

IV, _1 M II G G A A iff! G A A C C A A T C C C T T -T 6 A T T A C T A G A A A A A T A A A T.A G C 2603 all.seq. 
«lo A r a t I 3 S> . t I I i 15 A A C C A A T C C C T T- T. G A T T A C T A G A A A A A T. A A A t A G C lorsil alltL. 
AG MI I 5 G A A TT • •• A 'A CCA AT C C C T.T T G A T .T A C T A G A A A A A T A A A T A C C cohl all.seq 

* AGA T II GGA MIM?M^ 

A £ * I % I G -« *•* J ! A G A A G G A A T;C C C T T T G A T T A G T A G A A A A A T A A A T A G C ne_3l6 ali.sS 
A C A T T. T G G A A T T A T G A A C C A A T C C C T T T G A T T A C T A G A A A A A T A A A T A C C a909 all.seq ' 



asi 

!850 



T T G G A C A G G C T A A C G T C A C A C T C G T T G A T C C A A T C T C C C T T T ATT T A A C A Majority 
12910 12920 12930' 12940 12950 



«94 TT C C A C A C G G T A A. C G T G A C A C T G G.T T G A T C C A A T C T C G C T T T A T T T A A C A 2603 all.seq 

» S _^^ GGAGAGG ^I^ ACGTGACACTGGTTGATCCAATCTCGCTTTATTTA ACA 18rs2I_all7_eq 

^•^^ GGAGAGGC I^ ACGTGACACTGGTTGATCCAATCTCGCTT T A TTTAAC.A oohl_all.seq^ 

!899 TTC.GA. GAGGCTAACGTGACACTGGTTGATCCAATCTCGCTTT.ATTTAACA ciblll allseo 

TTG C A CAGG r T AAC ^^ 

TT CCA C A C GCT A AC GT GACACTGGTT G-A TCC. AATCTCGCTTTATTTAACA _909_all.seq 
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CT A AC A A»f'frAT >; * A"'g9' fefe C gjgj f Wa^A C A A G A A C T T G A C C A C C T A C A A Majority 



12960 12970 12980 12990 



13000 



12944 CTAACAATGATGAAGACCCTCGTATTCAAGAAGAAGTTGAGCAGCTAGAA 2603 ail seq 
12717 CTAAGAATGATGAAGACCCTCGTATTGAAGAAGAAGTTGAGCACCTAGAA 18rs21 ail seq 
\^tl ^AACAATGATCAAGA.CCCTCGTATTGAAGAAGAAGTTCAGCAGCTAGAAa,hl all. seq 

CTAAGAA.TGATGAAGACCCTCGTATTGAACAAGAACTTGAGCACCTACAA cjblTl ail seq 

S« taagaatgatgaagaccctcg tattgaagaagaagttgagcagctagaH nLa^au!^ 

12950 CTAACAATCATGAACACCCTCCTATTCAACAACAAGTT-GAGCACCTAGAA 3909_.a7l.seq 
G A T A A G A T 



12994 C AT A AG A T 2603 all sea 

12767 GATAAGAT 18rsi! "iHea 

12999 GATAAGAT ^f^i 'J, 

12999 G A t Tf i f 
13000HA cjblll_ail.seq 
Si T A A T A T nea.316_ail.seq 

13000 GATAAGAT a909_ail.seq 

Decoration 'Decoration #1": Shade -(with solid black) residues that differ from the Consensus. 
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51 
51 
51 
51 
51 



101 
101 
101 
101 
101 



151 
151 
151 
151 
151 



201 
201 
-201 
201 
201 



251 
251 
251 
251 
251 



301 
.301 
301 
301 
301 



"351 
351 
351 



401 
401 
401 
401 
401 



S^5£%,'WQ 2 oo 6 /o7 8318 l ^"'^^^ e <'^ w e ig h.^e. - ^PCT/US2005/027239 

W ffiS j^j¥¥l^ '^""CTCTIUTAtCCT, ■.ajor.ty " 

52 2 ,° 30 40 SO 

C G CCT TGTTCCCATGTTGAT||;CCCATAACTCCT CCCT<:ATTAATAGCCTr ?fini 
GGC^TT^TTCCCATCTT^T?^^ 

UCCTTOirCCCAICTTCArCCCMIAUTCCT.C^CnUUUCCuS:^ 
TTCGTAACGCTCTTT AA TTATCTCTAACTTAGCATCCGTATTGGTAAAAT Majority 

T ___ 3°1 «°° 

I" GT AACGCTCTTTAATTATCTCTAACTTAGCATGCCTATTG GTAAAAT 2603 a!2 ««, 

TTCGTAACGCTCTTTAATTATCTCTAACTTACCATCGGTATTGGTAAAA? St f ifL, 

TTCGTAACGrTrTTTA^^^ A ^ G ^ G ^ AAG ^ T ^ GG *^ GGG ^'^ ' 5^aI2?seq^^ 

TTCGTAACGCTCTTTAATTATCTCTAA C'T T A G'C ATGGGTATTGGTAAAAT olh7l I _7?L_ 

TICCTAACOCTCTITAATIATCTCTAACrTACCATGUnnccnulTS:^ 
TTTGAAAATAGACTAACTATTTATTAACC TCACCCCACTTTCTATCCATG Ma.„ rlf „ 

. l \° 120 «y uo ~^o 

TT T G AAAATAGACTAAGTATTTATTAACCTCAGGCCACTTTCTATGCATr' Jm «_, 

^?rJ A w? A r A ^ A ^ TATTTATTAA CB,T CAGGG CACTTTCTATGCATCSIS al2.seq 
TTTCAAAATACACTAACTATTIATtAACCTCACCCCACTTTCTATCCATC clhTl I „l?<= M 

tttgaaaatagactaagtatttattaacctcaggccactt?c?a?gca?g ^"l,^ 

AAATCAATTTCTTTATAGAATT G T T C A C G A, A T A G G A G C T T C T C C A G C * A r. Majority 
160 JJO 180 190 zoo 

^aT-A*???.??? A ? A ^ AA "^I CACCAATACGAGCT TCTGGAGCAAC 2603_al 2 .se<, 

A A t £ C A A"T~T ^r?^ A ^ A ^ AA I^n C f CGAATAGCAGCTTCTCGAGCAAC 18rs21_alHeq 

AAA?C^^?r?^ A J AGAA IJ GTTCACGAATAGGAGCTTCTCGAGCAAC 51S_al2. S eq 

AAA I CAA I TTCTTTATAGAATTGTTCAGGAA TAGGAGCTTCTCGACCAAC cibiu al2sea 

AAATCAATTTCTTTATAGAATTGTTCACGAATAGCAGCTTCTCCAGCAAC ^"al^feT 

MTACCAtCCCCTCAACCA C AAAC.TGTCCUAAACTCCACCCtCCTC.TAC Majority 
__ fJP 2f0 230 240 250 

?A?AfcCA?CCCclc A lcr A ^ A M 2603_al2.se., 
TXT^CC^T^rrr^rA^rrArM^ 0 * 00 ^^ IBr^l^l-^seq 
tX™ GC _T CC . CC ? G » ACCAGAAAC ^ G J GCAA ^^ SlS_al2.se, 
T^AG^XT^rr^rAArr^f^™™ 0 ^^^^ cjblll_al?_eq 
TATAGCATCCCCTGAACCAGAAACTG.TCCAAAAAGTGCACCCTCCTCTAG h36b_al2.seq - 

CAACTCTTCCCTCTCTGTTAGGACAGTCA AAACCACCATCTATACCTAAT Majority 
_°. f™ , 280 290 "ISo 

fAirTrTT^r^ CTCT ^ I ' '* T ^ T * T ' cc T'*T 2603 a!2 sea 

r AAC J G ^ CCGTCTCTGTTAGGACAGTCAAAACCAGCATG TATAGGTAAT ^iLalHeo 
C A_^ G ^ CG _i TCTCTGTTAGGACAGTCAAAACCAGCATCTA TAGGTAAT SlH f sea 
CAAC J G ^ CCBTCTCTGTTAGGACAGTCAAAAC CAGCATCTATAGGTAAT cjoll a_Tsea 
CAACTGTTCCGTCTCTGTTAGGACAGTCAAAACCAGCATCTAtAGGTAAT M6b!aT_ !seq^ 

T T A A A T.A T T T T T T C T C C A A A G A G T.T n T C C A T A A T A A T C A T T A A T C C C A C r. u a 

■ ■ 3 i Q . ■ 320 . . 330 . 340 350 

•T T A A A T AT T 11 TT C t" C > t A jr! ° 1 1 FTP ATAATAA'TCATtAATCCCACC 2603_aA2.se, 
TTA-X ATAt^??? 0 ? 00 A A ACAG H C I CG ATAATAATC ATTAATCGCACG ISr^UaiCeq 

t^^tTt^t^^ g ? cc m agag .jj c j cg ^^ sis_ a i2^eq 

TTAAJtTATTTTTT-CT'rri I A r i"? V t p * A T A A TC ATT'A AT C CC AC-C cJblU_al_^eq. 

TTAA.ATATTTTT T, C T C C A A A G A G T T C. T C G A T A A T A A T C A T T A A T C 0 C A C G _36b_kl2.se<J 

■A T A A C G T T T T T T C A T A G G A T A A TT G T A T C AC A ATT f T A A C T A AAATAACC M a w^ 

' ■•• ____________ ■ • 370 ■ 380 390 "loo 

•A? Aa" C G?T???T CA ? AGGA i AAT .T GTA TCAC AA TTTTAACTA.A.AATAACC 2603.ai2.s_, 
AT AA CGT?TIT T r.? ACGA I AA H G I A J CACAAT T TTAAG TAAAATAA.CC IS^Lal^eq 
A 1 AACGTTTT TT CATAGGATAATT G T ATCACA ATT TTAAC T A AAA T A A r c -to ™~ 

351 ATA A CGT.TT.TT TC A T A G G A T A AT T.C T:A T.C AC A.ATTTT A ACTA A AA T*A A C C clbTll ad2^sea ' 
351 ATAACCTTTTTTCATAGGATA^ * 

TC-AC TACTA CAA T-A AAA CTAAA A.AAGATTCGAACCTCAGTTAGTCCCAAT Majority 
410 420 > A-m aak 7H 



4 ?° J 430 440 . 450 

— * ■■ i t . 



I 1 ~- . -__HJ 

TCACTACTACAATAAAArTAAAA AAGA TT GA 2603_al2.se., 
TCACTACIlCAiTAAlifTMiw I8rs21_al2.seq 
i CAC Z AC AT ACTAAAAAAGATTGGAA C g_T C A G T T A G T C C C A A T SIS aI2 sea 

TCACTACTACAATAAAACTAAAAAAGATTCCAA C _ «» 1 ___<« > U C. T_9 C C A A To bTl aTIV, 
TCACTACTACAATAAAACTAAAAAACATTGGAAC Cmcn " cTCCcl" fa^al^^T 



light taWS. 



.Th%^ W^yVQ^WJU 78318*** ™^*fth Weighted residue weight tab/ ^ ' ^PCT/US2005/027239 



460 4 t° ~ s; 



451 
451 



— T- 1 _j . 500 



« c TTT „„ tacttca -;»---„cc„,octj^ 

CCAGTTAGATTCAAAATACCATAACCAAGTATA ^ AACCAr.rTAAAACATr toJority 
S ?° 5 ?° 530 s ; 0 So 



501 
501 



TCTCCGA AAATC AACCCCT A G G T A 
i i 

560 570 




— , — i 

AAA cjblll_ai2.seq 
A A A A h36b>_al2.seq 



601 
601 



f C AC AACACTTT CCATATACTCTTAGCf Hajorlty 
ZiZ V • 630 64 0 .650 

mr ™aVcaaacccaa 4 *tacctt^ 

mi AGTACTGCAATAAAATAATAATAC.TCCCAAA T ATCATAAATr.TTrr w.^, 
660 .670 ftftn '' IJI : " '- : ~T 



690 7Q0 



Ca.TCGAOTG CCC ACT-COCA A A.CCA AT A G C C A C P. T r. c A A A T A n T A A_A TGCC Majority 
7 ?° 7 ?° 730 ■ t;40 7 s 0 . 

— ^I — CCACCTGCAAATACTAAATGGG 2603_al2.s«i 
ACGAATAGCCACCTGCAAATACTAA A t r. fi n io„,, 



5 F^Il Jxt.ta C T C T J T G A A A A A T A tj? TTTAA ACAA AC t ^ ^ CAT. m^i^ 

751 D T A A A G T T G G T C T T A C T C T T ¥ C A A A a a I it 5 I I I I M U A A A G T A 'T A C A T 18re21_.ai2.seq 

T C * T T 0 A A A A T A A GT TTT AAA G A A A CT A T X C A T l4bV^:^eq * 



• . 1- - « « o. wr * *j. * XI V,./V * IMOO^aiii 

- A .T A C C A.G A G A T A A T A G C* A y T T A C T-G C" G AT1.AATH T A c'-C T T c'a fi'CAT-ACC -A 

.% -A T A C C A.G A G^A T A A TA G C A TIIA CTt C G A T AAA ^ ^ ^ ^ ^ "l'T A C ^ A^ T jl £^ ^ l^l^^ 3 ^ 
B0, ATACCAGAGATAATAGCATTTACTGCGATA'AAiC' T A G fc T T G A G G A- T A C C A •M^^lZ^seq 6 '' 
CTTCTTAAGGTA A.C A C A A. A G T C A C-G C T C A T A AT f^ G CAATAGCTA T*C TCCC- Majority 

!2! __ **° . 8^o Z> 

CTTCTIAACCTAACACAAACTUCCCTCUiJ^^Jl^^^^^S'? 



"651 
85*1 
851 



-age 2 
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ipnr 



901 
901 
901 
901 
901 



951 
951 
951 
951 
951 



1001 
1001 
1001 
1001 
lOOl 



To-- 

1 



930 



940 



950 



TTTGGCAACTG TCCTCTAACACTT T C T T G A 

i . r 

970 



960 



ATAGTT TGGTCAAATGAAAT Majority 
980 990 1000 



TTXGGCAACTGTCCTCTAACACTTTCTTGAATAGTTTGGTCAA'AT cH A A T 1^^^ 
OGCAACTGTC. CTCTAACACTTTCTTGAAT.AGTTTGGTGAAATGAAA'T h36b.a72.seq 

TACACTGTCCGCCCAATATTTCATCACCAATCCTAA A CTCAAAAATA A r. A Majority 

1020 



1010 



1030 



1040 



1OS0 



-ca-g^g'gg^a^^ 



TAA.TAGC A AT AAATGCTT 



1051 
1051 
1051 
1051 
1051 



1101 
1101 
1101 
1101 

iioi 



1060 



AATAACTTTACT ATTTTGACGACATAACATT Majority 

1090 1100 



1070 



1080 



TAAT"AGCAATAAATGCTTG^ATAAGTTTACT A-T TTTGACGAGATAACATT fsr-si^ 12 " 3 ^ 

A AUCAATAAATG.eTTGAATAAGTTTACTATTTTGA"PCAr A T A t c a -r t *^T. , TT* 
T A A.T A G C AATAAATC£T TCA'A T A A^C T T T A C T A TTTTfi A C -*G AG.ATA AC ATT Mefc^alzlseq 6 *'' 



A G T € T T T T T.A T A T C T T T C T A A T A T T G G P 
1110 ii3o 



A A A C A AGCCAC.CT A.'A CTTACAT Majority 
11 30 1140 uso 



ACTCTTTTTATATCTITfcTAAfAT'Tf^ 2803^12. sec, 



agaaaacaat cgaaattaaaattccctc'a 

1170 1180 



1160 



ACGAT ATTAAATGGAATAACC Majority 



1151 
1151 
1151 
1151 
1151 



1201 
1201 
1201 
1201 
1201 



1190 
— i , 



1200 



AGAAAACAl?cr AAA ??MMn^ CTCAACGATATTAAATCGAAT AACC 51S_al2. S eo 

A T.T GTTAAA'ACGTAATTGCCTACA- 

1220 



1210 



C c A A T A A A T G T T C T-G ATATCAAACTT Majority 



1230 



1240 



■ — ■ ; 71 — 1250 • 

MI J II M A A G C T A A T T.G G C T A C A C C A ATA AATGTTCTGATATCA AAGTT 2603 *!? ^ 

a i ioii AAAAGGTAATT G.C C T A C A G C A A T A A A T G T T C T C A T A TC A A A r tt ^tuftt 7* 
A T T G.T T A A A A G G T-"A A T T GO C T A C A C GAATA A A *T G T T C T> £ J.A £ pi A A G 

A C C A A AT AT AG C AT A C A A* A' C G A A T C CCA A A G A C A T AG T T C A G A G'C T A C'C A.' Majority 
1260 1270- inn ™„ ^00 



1251 
1251 
1251 
1251 
1251 



1301 
1301 
1301 
1301 
1301 



1270 
— x_ 



1280 

— L- 



1290 



A G C A X 'A T A T^' GCATACIAAfiCA'lTCr'r i * G-A*C ATAGTTGAG'AGp TA'CCA SlSLjalil^seq 86 ^ 
A'-C C A A A T A T A G-C ATlr 1 » irp i t'J S S • ^ A C A T A G T - T PAGACCTACC'A cJblll^lTseq 

AGCAAATATAGCATACAAAGGAATCGCAAAGACATACTTCAGACCTACCA MSb^Ti.seq 

TACATACGGTCAAGCTAACXCTACCAAA.TAGA^TA CCTTTAATAAAATnT 

13 -'° »330 ■ , 1340 ( i^o 

T A tATACCC "AACCTAACTGTACCAAATAGAC TACCTTTAATAAAATfT , Km 
T A-G ATA C G .CTCAAGCTAACTBTACCAAATAGACTAfiCTTTAATAAAATCT ££tt I \T 
T A G A T .A C H| GTCAAGCTAACTGTACCAAATAGACTACCTTTAATSAAATrT £?f ,o 
IAGATACCGTCAAOCTAACICTACCAAAT "i n, A A £ A A A A £ C £ c bm a ,? 
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1351 
1351 



1380 1390 1400 



*CCTACACCAACCATA TTC ATCG C TAAACCAATAAACCTTTCTCCACn-Afi Majority 
' 4 . 10 »«0 1440 I4SO 

CATTACCAACTATAACTTT T:A AAAGTCATCT T AATAAGAGTACACCATA A u*,„,*„ 

J£2 iyo . 149 Q 

ot?Jtcc^S?A^AACT???M^?JS!JSJIf AT * AGACTApACCATA * 
CATT^^r AA r^ A ^ AA ^^^I AAACTCATCTTAATAAGACTACAC CATAA5^S alZ.seq 
GATTAGCAAGTATAACTTTTAAAAGTGATCTT. AATAAGAGTACArrAXA A ,-ik77. 
CATTACC AAG"T,ATAACTTTTAAA AG TGATCTTAATAAGAGTACACCATAA ftS^^aT^jSeq^ 

CTTCATTTCAAATCAAATAAAATAA.A A-C C A ACTAACATCCGA A G ,C ATTG A Majority 



1451 
1451 
1451 
1451 
1451 



1501 
1501 



l5 . 10 l5 ?° I*™ 1540 1550 



. 1501 



1551 
1551 
1551 
1551 
1551 



CTTGATTTCA^AATPAA ^Z^^^^^^^^^^^ ^^A^^^-^'*-'^ GAAG'GATTGA 2603 al2,seq 
TSOI C^Gi^J^^^M!^!M TAiAA G ^" A ACT AACATCGGAAGGAtTGA 18rs21 al2^L 
IsO C??feJ???? A A A?r a til iii - I a A AAG H A ACTAACATCGGAAGGATTGA 515 ali.seq^ 
C??GA^?r AAA ^ AAA J A M ATAAAAG cjblilj^seq 
C T T G A T T T C A A A T C A A A T A A A A T A A A A G C A A C T A A C A T C G G A A G.G A T T G A liebJSi.s^ 

AAAATCAACCTTTAAAAATTCTGCTCCtGG T ATTAATGGAAATGAAAnCA Majority 
15 . 6 ° a5 . 70 ; ■ . *580 1S90 7^0 

AAA A T ^ A- A C C T "T T A A a A A t I r t'^ ^ ^ ^ T A T T A A T C G A AATCAAACCA 2603.aI2.seq 
IaAA^CA^C^AaM ISrsit alSeq 

AAAA£r AA rr£Tx AAAA ^ a!2.seq 

AAAATCAACCTTTAAAAATTCTGCTCCTGGTATTAATGGAAATCAAAGCA hSSb.al^.seq 

TCATCAATACAAAAGATAAGCCAGAAA G AATGGCGATTGTCACCATTTTA Majority 
- 16 , 10 ^ 163Q 1640 ^ ^0 

J™ •3^ tcaa taca.aaagataagccagaaagaatcgc cattgtcaccatttt1 26m at? _ 

160 1 TCATCAATACAAAAGATAAGGCAGAAAGAATGGCGAT GTCA ATTfn » ,* 

IS! ! catcaat a^aaaacataaggcagaaagaatggcgattgtca^ 

! S TCAATACAAAAGATAHGhGAAAGAATGCCGATTbT A nn cf bll 

1601 tcatcaatacaaaagataa.gcacaaagaatggcgat'gJcaccu 

CGTGTATTTGTCATA. AAAAAA TTCCTCCAAT T T A A A T A'A A T.T G A A A C A A G Majority 
- 1660 , 1670 1680 1690 !^G0 

C6 T GTAT^TT^G'TC A T A 1* A a^ a a " A «r ^ ^ ^ ^ A A *^^^ AAA T A A A *T T G A A -A .G. A A G 2603 ai2.seq 
\ts\ CgIcTAT HI VAt ^ A L A A A J ^ cc TCC A' AT I T A A "A T A A A T T g! M jtf £ A C 

1651 C C T G T A t-T T G T C A T A A A AAA A T TC C TCCAA T TT A A AT A A A T*T G A A AC A A r "SrS 1? 

A651 ^ GGTfiT-ATI T'.QT^ C A T..A A A ..A AA A T T*C t-T'CCA A^ T T *T.A/A A T A*£ A'TT-fi A A AG A A G WBb^alSls^ 6 ^ 

C T.C G A A A C C t A A G C G T A T G T A C G C C A A A A A A - C C T T.T~C T C T T C T C C C A T C MaWl^ 
1710 1720 * 1730 " •■ 



, -y - 1740 1750 



I™ C^CAaIg^ta t^r^?^^ 
.™J CTCCAAAGGTAAGCGT A T.G T.A C G C C A A A A A A - C C T T T G T C T T C T C C C A T C lWl »iHL, 

™J r'x r^'^^^S^ TAAG< ' GT A ^ CIACC.C'CAAAaA aQ C C T T T GT C TT C TC C C A T C 5^1^^ 
1701 C TCC A A AC G T A A G C G T A T CT A C.G C G A A A A A A-C Ctt TO TC TT C T C CC A T r 
170t CTCCAAAGGT AA G .C G-T ATC t ACCC0 A A A A A A - CCTTI CTCTTCIC CC AIC Meb'aTSlseq*'' 

CAGACTTTACTCTCGCTTGTCC A A T C T C A C C A n A T C A C C T T T C C C T C C C G Majority 
»760 ,770 1780- 1790 7^0 



I 750 CAGA CTTTACTGTCCGTTCTCCAATCTCACCAC ATCAGCTTTCGCTCfrr ?fim 

1750 CASACItTACTSICtSTICICCAAICICACCACATCACCTTICCCTCCCC 

CACACTITACIGTCC.CITr.Tr.c«»TrTf i^,;;;^'„;".rrI? GC0 18rs2I_al2.seq 



17S1 



CAGACTTTACT.GTCG.GTTGTGCAATCTCACCACATCAGCTTTCGCTrrrr ° 

1750 CACACTITACTGTCCCTTCTCCAATCTClCCAaKU^^^J^CcKv' 
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me^Wwtth Weighted residue weight table. 
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TJGGAAGCGATTACCGCCGGTC 



T 

1830 



Majority 



1800 
1800 
1801 
1800 
1800 



1840 



I8SO 



CACTCATGCTTCACAA CTC AC AAATAAG 

P C "Vi a D gO 5 ..j^t^fH^ 

CACTCATGCTTCACAACTCACAAATAAGTTCCA ACCCATTACCCCrrrT?- ■>, ,-, 

GACTGATCCTTCACAACTGACAAATAACTTCGAACCGAT^AtcCCCrr^r TH '.T* 

GACTGATGCTTCACAACTGACAAATAAGTTGGAAGCGATTACCGCCrrTr ST*^'-** 
CACTCATGCTTCACAACTGACAAATAAGTTGGAACCG^^ACCGCCCG^r 

CACTGATCCTTCACAACTGACAAATAAGTTGGAAGCGA^TACCGCCCG^C 25^2^ 



CGG-AATTACACCCTG C C 

i — ,. 

1860 



C T G A 



1850 
1850 
1851 
1850 
1850 



1870 

_I_ 



AGACACCTATAGCATAACAAAAAAA ACTT 

• ' — l r 

1880 1890 



Majority 



190O 



-1900 
1900 

isoi 

.1900 
1900 



1950 
1950 
1951 
1950 

.1950 



jjjp«»ii»sMH»Hiim»^ 

GCAATTGCAAGTTTTTTAATTACTAATTAGTAGTAn T GATTAAAAATfTAT „ aJorlty 

121° ^ 1930. 1940 • ^50 

GCAATTGCAAGTTTTTTAATTACTAATT^r 2603_ a .2.s eq 

ATTAATACCA AATTACTATACTGTATCGTTtCT T TC AGATTTGCTATTTT Hajorlty 
" W70 1980 i 990 



1960 



■ J • ■ , — zuoo 

a ^ Z t A T * C C AAATTA CTATACTGTATCGTTTCT TTCACATTTGCTATTTT ,» 

attaataccaaattactataCtgtatcgtttctttcagatttgctatttt 22^ 8 ;f q 

ATTAATAC€AAATTA C T A TACTGTATtGTTTCTTTCAGATTTGCTATTTT 55? ^ 
ATTAATACCAAATTACTATACTGTATCCTTT^TTTrJrJTTTrrT^II S15_ai2.seq 
A T T-A A T' A C C A A A T T A C T A T A C T G. T .A T C G T T T C T T T C AGATTTGCTATTTT MSbualzlseq*'' 



-2000 
2000 
2001 
2000 
2000 



2050 
20S0 
2051 
2050 
2050 



2100 
2100 
2101 
2100 
2100 



.2150 
.2150 

21S1 
.2150" 

2150 



2200 
2200 
2201 
.2200 
2200 



TAGTTTTTCT-TAAAAAGAT-AA A C AAAATTCCC A- A AA T A A.TAC AACC A A n A. Majority 
'~ 2 °.'° _f°f°_ 2040 ~~ 

^g^^^^^ A . A ^^AAACAAAAT TCCCAAAA. T AATACAAC.CAAGA 2603 a i2. seq 

^A^ ^ '^ CCCAAAATAATAC'AACCAAGA SIS 3 ^!^^ ^ 

TAGTTTTTCTTAAAAAGATAAACAAAA.TTCCCAAAATAATACAicc^rA 1 
TAGTTTTTCTTAAAAAGATAAACAAAA T'T CCSCAAAATAAT. ACAA CCA A G A MHbusGslseq*'' 

A T T. C T C A G T C'C T C.C -A C C AATAATC A TTCCTGTT T TA GGA A G AAA T.G A T T G Majority 

2£ g* 70 . 2O80 2O90 

ATTGT?AC?cc?cCACCA^^Jr^J^L CTTIT * 6GAACAAATe " 18^12^ 
Axiuii.A.i»icCiCCACCAATAATCATTCCTGTTTTAGGAAGAAATrAT-r/»«rte To 

ATTCTCACTCCTCCACCAA.TAATCATTCCTGTTTTAGGAAG^A?G^?r ^ n f ? 
ATT6TCAGTCCTC CD CCAATAAICATTCCTCXTTTACSAACAIATGATIC SS^^T 

TGCAAAAAC. CGCT T GT GAT G G T T T A G C A T T T G T T C C T C G A G G A G T T T CT T Majority 
2110 • 2 _if° 213? .2.140 zTso " 



i — : i : : -yy 2.140 zigg 

I!^ liA ' A 9 C C 9 T T G T.G. A T G C T T T A C G A T T T C T T G G T G G A G G A C T T T C T T 2fim <=~. 

TOGA A A A, A G.C G G TT G TG A TO tTITA C G-A TTf'c TT r r -r r r i r * V * t ii ~ I 2603_al2.seq 

W G A A A A. A G C G G T T G T G A T-fc G T T T A G G A T T T G T T r ; r t r r *>t JtS ISrs^U^.sec, 

TGVAAA»AOCCGTTfiT«IfiCTr.ACGAT,?cy?E?^ 

T G G A A A A A G C G GT.T G T G AT C G.'T T T A C G" A T'T T G TT C CT C C A G G A C T.TTCTT 

T T TG C T T T TC TACCTC TACT T C C T CT C T T TT A T T A r. C A ACTA G A r. C. A A r..t uL, f 

.TTTCGTTTT^TAC CI CTAC'TTdc.T GT G T T T T ATTAGCAAC T A C A G C A A C T W^Zi'^ll 6 *' 
l?TPG% T TT T T.CTl^^.r^l^ 

TT.TCGTTTTCTACCTCTACTTCCTCTGTTTTAT^ACC^ACT^CAGCA^ 

ACACCATCCTTCATAGATATACCGTAACCAf.TT A GTGCTTT-TGCTT.CTCG Majority 
^ 2230 2240 2^0 

ACACCATCCTTCATAGATATACCGTAACCACTT AGTCCT T T T GCTTCTfr ' ?ftn^ 
ACAGCATCCTTCATAGATATACGGTAA C'C AGTTAGTGCTTTTGC TT r T r r ^ t. ^ 
ACAGCATCCTTCATAGATATACGGTA-ACCAGTTAGTGCTTT TH CTTCTCC ,f ^ 

ACAGCATC-CTTCATAG'ATATACGGTAACCAGTTAGTGCTTT T G CTTCTrr olf M 
ACAGCATCCTTC A'T AGATATACGG T-A ACCAGTT'AGTGCTTTTG C'T TCTCC M6b aTz seq^ 
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me5T8avw 
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A A A A A 



X A» ir6 T t < 



2260 



2270 



TCAACCTCAATTTCTCCCTTAT Majority 



2280 



2250 
22SO 



2290 



AAAAATATACTTACC 
AAA 



2300 



AATATArrT A Vn*^IMI-M CCTTCAACCTCAATTTCTCCCTTAT 2G03_.ai2.seq 
AATATACTTACCAGGTAATAAACCTTCAACCTCAATTTCTCrrTTAT ift^i _ 
^ 4 a » * * TATACTIACC»CCTAATAA*CCTTC*ACCTCAATTTCTCCCTTA A8rs2I_al2 .seq 



225o aaaaatatacttaccaggtaataaaccttcaacctcaatttctcccttI? ^IhT?! 2 :!? 1 

^ AAAAATATACTTACCACCTAATAAACCTTCAA^ 

CATCAGTTACTAA TGAA GTAATCCCATCT TCA TCGCTCCTAAATCGTCCA Majority 
23 . 10 2320 2330 2340 2350 



2300 
2300 
2301 
2300 
2300 



2350. 
2350 
2351 
2350 



2400 
24Q0 
2401 
2400 
2400 



CA^CAG^r^A^ 2603_ai2.seq 
CATCAGTTACTAAT.GAAGTAATCCCATCTTGATCGGTCGTAAATCGTCCA 515 a 12 sea 
CATCAGTTACTA AT G A AGTAATGCC ATCTTGATCGGTCCTAAATCGTCC A h36b_al2.seq 

TTTTTAAACCGAACTGGCTGATTCTGG T TATCGTATAATACAAATATTAC Majority 

^ 2380 2390 2400 

T^f^Hj^^ 2603_ai2.seq 

m^AAAG^ r A Ar?r^ 18rs2l_ai 2 .seq 

i??jj aaa "® aa "s^j^ ga ^ si5_ai 2 .seq 
T^T^T^ AA cr£ AA ^r G ^ ^11^^ 

TTTTIAAAGCGAACTGGCTGATTCTCGTTATCGTATAATACAAATATTAC .h36b_.al2.seq 

TCCGGATAGCC TTTTCTTT ATCTTTCCTTCTTTTGTATATTTAATAAG-TT Majority 
. 24 , 10 242Q 2430 2440 2450 

TCCGGATAGCCTTTTCTTTATCTTTCCTTCTTTTGTATATTTAATAAGT T 2603 ai2 sea 
TCmATAGCGTTTTCTTTATCTTTCCTTCTTTXGTATATTT A A GTT 18r5? I'lTL. 
TCcBGATAGCCTTTTCTfTATCTTTCCTTCTTTT T T TT Sl^lif^ 

T " CG-ATAGC. cQt TTC TT'TA-TCTTTCC T'T C T T T T g|S A T A T T*T AATAAGTT teellljl 

TTAATCCCCCTGTTTCAACTTTTCCCT T AGCATTTATCTGTAATTCATTT Majority 

2500 



2460 



2470 



2480 



2450 
2450 
2.451 
2450 
2450 



2500 
2500 
2501 
2500 
2500 



2490 



TtAATCGGCCTGTTTCAACTTTTCGCTTAGGATTTATCTG.TAAT.TGATTT 2603 a!2 sea 
T T A A T r f* r r «I5 JH5 AACTTTTCGCT.TAG G AT TT A'T CTGTAATTGATTT 18rs21^al2Vseq 
IJ AA ^5^ CTGTTTCAACTTTTCGCTTAGG ATTTAfCTG.TAATTGATTt515ai2 seq 
TTAATCGGCCTGTTTCAACTTTTCGCTTAGGATTTATCTCTAATTGATTf clbUl ilTsea 
TTAATCGGCCTGTTTCAACTTTTCGCTTAGGATTTATCTGTAATTG ATT T h^blaT^seT 



GATAACTTATCATCTGGT AA T T C A A T AT AA A A A G G T A C T AT T GTTGA A A C 
2510 252Q 2530 2540 



Majority 



2550 



GATAACTTATCATC T G GTAATTCAATATAAAAA'GGTACTATTGTTGAAA C 2603 ai2 sea 
GAT A ACTTATCATCTGGTAATTC A A T A T A A A A A G G T A C T A T T G T T G A A A G SfaifSeq 
GATAACTTATCATCTGGTAATTCAATATAAAAAGGTACTATTGTTGAAAC SlS^aii sea 
GAT A ACTTATCATCTGGTAATTC A A T A T A A A A A G G T A C T A T T G T t G A A AcS f^sea 
GATAACTTATCATCTGGTAATTCAATATAAAAAGGTACTATTGTTGAAAC ^lallls^T 

GCTTTG ATCAGCTTTATAA.GCAC GA C.CAAAGTACGAACCATTTGGGACTG Majority 

2570 2580 2590 2600 



2560 



2550 
2550 
2551 
2550 



2600 
"2600 
2601 
2600. 
2600 



2650 
2650 
2651 
2650 
2650 



GCTTTGATCAGCTT T A TAAGGAC GACCAA'AGTA C-G AACC ATTTGGGA G T G "2603 _ja!2 sea 
GCTTTG ATCAGCTTT.A,TAA.GCACP ACCAA AGTACG AACC ATTTGGG A U S 111 sea 
rrll Ir A I * AG ° T T ' T - A T A A G C A C G A C CA A A G T'A-C GAACC ATT T G G G A G T-G J£g ^ 
r r HI'S A I GAGGTTTATA . ACGACG A CCAA AGTACGAACC ATTTGGG AG-T ^seq 
G C TTTGAT.CAGC T T T A TAAGC ACQ AC-C AA AG TA C G A A CCA T T TG G G A G T G d^lz.s^ 

CTATCTTTC TCTGAfcCATTAG T A T C AG TAGCAG.AAGTC AAGATA C T C TT A Majority 

26 t 10 ' 2620 . 2630 2640 . 2650 

^ A ^II TCTCTGACCATTAGTATCACTA G G A GAAGTCAAGA.TACTCT. TA 2603.jai2.seq 
^ A I G III GTCTGACCATTAGTATC ACTAGGACAACTCAAGATACTCTTA iars^aSeq 

A I G I TTGT<: I GACC AT TAGTATCACTAGG AG AACTC AACATACTCTTA 515 ai^seq 
rTiTrTTTr 1 ^ ''^^^^ ^ Tj, ^ GAGAA ^ T C\AA GATA'CTCTTA cjbti l_a ^seq 

CTATCTTTG.T C TG ACCATTACTATC AGTAGG AGAAGTC>AGATACTCTTA h36b_.al2.seq^ 

TA CTTCTG GT.TC A A TTC GC TATCTGTC ATTTCCCTC A AT A A ATCAA CTTT Majority 
26 , 60 2670 2680 2690 2700 

^^ C ^^^^ GGTTCAATTCGCTATCTGTCATTTGCCTCAATA AATCAACTTT ISrsil al2^eq 

i*^33^3^^ T ^ CAATTCfiCT A T CTGT CATXXGGCXCAATAAAXCAACTTX SIS aii.seq ^ 

^jn^^ TTCAATTCCCTATCTGTCATTTGGCTCAAT AAATCAACTTT cjbll l_ai2^seq 

TACTTCTGGTTCAATTCGCTATCTGTCATTTGGCTCAATAAATCAACTTT h36b a!2 .seq 



FIGURE 19E 



n metJ^^Mm Weighted residue weight tabfe. 
T * * G T llfiPj V jjjftj | ItifZH Yfi C :| i_f *»»nCCCCICCT CTCIAC UaJorT^T 

'«'^27io' ' "» £j2&"" '•■«•»« » "'2730 2740 



'g?&Q 2006/078318^' we '^ 11 ^^^^ ^PCT/US2005/027239^ 



^ TAAGTTGTrArTr a r AG ^ GGA ^AAACGATAAGAAATCCCCTCCTCTGTAG I8rs21_ai2^eq 
If!!!I CTCAGTCACAGTCCATAAACCATA ^ CA AATCCCCTCCTCTCTAC 515 ai2 seq 

2700 TAAGTTGTCAGTCACAGTCCATAAACGATAAGAAATCCCCTCCTCTGTAG b36b__al2_seq^ 

TATTrGGCTGAA GTCC TATCTGTGTGATTGTTAGTT.GATTAGGGCTATCA Majority 

27 f 60 2770 2780 2790 2800 

l^lll^^^l^^ GTCCT ATCTGTGT ^ ^^^<^^^ AGTTG XT TAG GGGT AT C A 2603 ai2.seq 

I A I I * ^ ^ G A A G T C C T A T C T G J ^ * i *eq ^ 

J A IIJ^H GAAGTCCTATCTGTGTGATTGTTAGTTGA cJfaTll al^seq 

TATTTGGCTGAAGTCCTATCTGTGTGATTGTTAGT. TGATTAGGGGTATCA hsebjl^ 

CCATTTACACTGGCTACCG AAAAAAACGCTAATTGT AC CAATCCTAAAAA Majority 
. ggg . jagg 2830 2840 2850 

Jf^^^^ 2603_al2.seq 

^tJTTACACTG^CTACCGAAAAAAACCCTAATTGTACCAATCCTAAAAA 18rs2i al2.seq 

^ A J^ A ^ A ^ TGG ^ TACCGAAAAAAACGCTAATTGTACCAATCC TAAAAA 515 ail.seq 

•GCATTTACACTGGCTACCGAAAAAAACGCTAATTGTACCAATCCTAAAAA cfblH a 12 sea 

GCATTTACACTGCCTACCGAAAAAAACGCTAATTGTACCAATCCTAAAAA ^ILLlt^ 

G C AAC ATAGTAGAAGTC CTAAACTTTTTCTAATCTTTTTCATTTTTGATT Majority 
gggg , . 2870 2880 2890 2900. 

Tj^yyT^TTT!^^ 2603 ai2.seq 

^M^ A J A 5 TAGAAGTCCTAAACTTTTTCTAATGTTTTTCATrTT TGAT^ 18rs21 aiHeq 
^^ AA ^ A ^ A A '^* CTTTTTC AT IT T T G A T T 515 a 12. seq 

• ^^^^ A Z^l AGKAG l CC T^^^CTTTTTCTAATCTTrrrCATTTTTGATT^blll a 12 seq 
GCAACATAGTAGAAGTCCTAAACTTTTTCTAAT C T TTTTCATT'TTTGATT h36b_al2^se^ 

TCCCTTTCTC TT TCTCTCT TT A AATTTTC GTTT T A AATATAATAGT A AAC Hajorlty 

29 , 1Q . : 292Q 2930 2940 2950 

ZrZrlllZl t T TTCTCTCT TTAAATTTTGGTTTTAAATATAATAG T A A A G 2603 a 12. seq 

l^f^ZZZnZ^Z T - TC ' TCTCTTTAAATTrTCDTTTrAKArATk ^ rAGTAAAe 18rs£l__ai?2eq 
I • ' I TCTTTCT ^ CTTTAAATTTTCGT TnAAATATAATAGTAAAG515 at 2. seq 
TrrrTTTr ^TTCTCTC T".T TAAATTTTCGTTTTAAATATAATAGTAAAG CJbTll^seq 
TCCCTTTC tH TTTCTCTCTTTA A' A TTTTCGTTTTAAATATAATAGTAAAG h36b_alS.se^ 

CCACTAATATAAGAAT AACTAGGATTfiATAACAGCAAATAAACTTTATAC Majority 
2970 2980 2990 3000 

C ? AC * AATA TAAGAATAACTAGGATTGATAAGAGGAAATAAACTTTATAG 2603 ai2.seq 
^c? GGAG I AA ^ ATAAGAATAACTAGGA TTCATAAGAGGAAATAAAGTTTATAG ISrsil al2.seq 
ftll GGAG J AATATAAGAATAACTAGGA TTGATAAGAGGAAATAAA.GTTTATAG5iSai2^ 
foS GGAG ^ AA J ATAAGAATAACTAG GATTGATAAGAGGAAATAAAGTTTAT.AG cjb7u_al2 seq 
2950 CGACTAATATAAGAATAACTAGGATTGATAAGAGGAAATAAAGTTTATA.G h36bju2.seq 

TCTCTTTCC A A TTCTTTCAT TAAATAGTTC'TTTTCTTTAACAGCAGCTAC Majority 
™9 3020 3030 . 3040 ' 3050 

"25? I n I *? III n° AA TT G T T T C ATTALA T AG T TCT TTTC ttt A A C A G G A G G T A C ISrs^Jail^eq 

151 G 1 TTG CAATTCTTTC A TT A A A T A G T T. C T> T T T C T T t A A C A G G A G G T A C SiS^aifseq 
^ IS 151 J TGCAAT-TCTTTCATTA A ATAGTTCTTTT.CTT t A A C A G G A G 6 t A C cJblU ai^seq 
3000 TGTGtTTG C A A T T C TT T C A T T A A A T A G T T C T T T T C T T T A A C A G G AG C T AC ia^a7 2 .seq 

AT A C T T GAT T -C G A T G C C C T € T A A C T A G T A A A C G ATCT'G.A AT T A A T C G A A T Majority 
• 3 y° ' • 3070 3080 . - -3090 v * 3100 " 

3050 A T ACT TGATTCG-ATGCCCT C T-A A C T A G T A A A C G A T G T G A AT T A A T C G A A T 2603_ai2seq 
.3050 -AT AC TTGATTCGA. TG CCCTC T A ACT. AGTAAACGATGTGAATTAATCGAAT 18rs21^iHeq 

3051 ATACTTGATTCG A~T GCCCTCTAACTAGTAAACGATGTGAATTAATCGAAT 515 ai2 seq 
3050 ATACTTGATTCGATGCCCT C T.A ACT AGTAAACGATGTGAATTAATCGAAT cjblll ai^seq 
3050 ATACTTGATTCGATGCCCT CTAACTAGTA A ACGATGTGAATTAATCGAAT h36b._a_2.seq 

AAGGTGT AC ATGTTAGCAAA.CTCGC.ATAATCCTTAC CTTTAACAACC AAT Majority 
3JU0 3120 ' 3130 3140 3150 

AAGGTGTACATGTTAGCAA.ACTCGCATAAT.CCTTACCTTTAACAACCAAT 2603_ai2.seq 
AAGGTX5TACATGTTAGCAAAGTCGC AT AATCCTTACCTTTAACAACC AAT 18rs21_al2.seq 
AAGGTGTACATGTTAGCAAAGTCGCATAATCCTTACCTTTAACAACCAAT 515 ai2.seq 
3100 AAGGTGTACATGTTACCAAAGTCGCATAATCCTTACCTTTAACAACCAAT cJblU al2 seq 
3100 AAGGTGTACATGTTAGCAAAGTCGCATAATCCTTACCTTTAACAACCAAT h36b ai2 seq 



2750 
2750 
2751 
2750 
2750 



2800 
2800 
2801 
2800 
2800 



2850 
2850 
2851 
2850 
2850 



2900 
2900 
2901 
2900 
2900 



3100 
3100 
3101 



FIGURE 19F 



Alignment Report <* 

Tnttis^Jufv29,: WO 200 6/078318 



Weighted residue weight table. 



A ATT T 



[ Aj,.g |U.|.A A |T X A T C J„C C C T T7 / 



PCT/US2005/027239 . 8 



ACTT A TTTCATCAAC CTTAT.A Majority 
'3180 3190 32oo 



JJS ^ TTTAC AAAAATTATCTCGCTTTACAA CACTTATTTGATCAACCTTATi 

2« AAT^ACA^A^^t^ TCGCTTTACAA ^ 

AATTTAGAAAAATTATCTCGCTTTACAACACTTATTTGATCAArCTTA'TA 
315.0 AATTTAGAAAAATTATCTCGCTTTACAACACTTATT^ K J 

91.50 aatttacaaaaattatctggctttacaacacttaJt^^cIIcct^aJa 



3200 
3200 
3201 
3200 
3200 



2€03_al2.seq 
18rs21_al2.seq 
SI5_al2.seq 
cjblH_al2.seq 
h36b_al2.seq 

CGCTAAAACTTCTTTGATATTATGAATATAAAA A ATTTTTCCTTTTTTAA Majority 

32 . 10 32 . 2 ° 3230 3240 ^0 

GGCTAAAACTTCTTTGATATTATGAATATAAAAAAT TTTTCCTTTT'TTA 1 ' , C n-» ,, 



3250 
3250 
3251 
3250 
3250 



GTTTATCTAAATCTCTAAATAACTTAGCT.TTAGC T AAGCCGCf, ATGACCT Majority 

3 ^ ?5™_ 3280 3290 ~ So" 



330O 
330O 
3301 
33 0O 
3300 



3350 
3350 
3351 



mi 1AAAU1AAA * CT GTAAATAACTTAGCTTTAGGTAAGrrcrrATn<ir^T *-«r- ,T M 

GTGATAACAGTATGTCAACTTTTTCCACCAA T TGGCAAr.GAGCTTr.nTTr Majority 
3310 3320 . 3330 3340 ^50 



1 1 1 ■ t . . t — OJJU 

^I^ ATAAC AGTATGTGAACTTTTtCCACCAAtTGGCAAGGAGGTTCCTTC 260^ ^ 

GTGATAA. CA G T.A T G T G A-A CTTTTTCC AC C.A .A T T G G C A aQg aQg TTCG T'T C M6b_alt fe^ 

AAGGTCTCCTCCTCCTTTTTCAAGAACACTACTG C TACTCCCCCCATACA Majority 
33 . 6 ° 3^70 3380 3390 3400 . 

Aa G GT C £cCTCC?cc???T^ 

TACGT-AATTTTTCCTTGATACACCCTAT-ATCA A TATATCCAATCATTTr; Majority 
. ^ fjgg 3430 3440 3450 

?AGG?AA?.^?^§^^ 

lAGGTAATTTTTGCTTGATAGACCGTATATCAATATATCCAATCATTTCA h36bi.al2.seq 
GC AATCTCAACC ATGTC GGC GTATTC A GC A A T A C CT TTTTTTTCT TTTTC Majority 

.^l 60 3 1 70 3480 349 0 1 3500 

G C A A T C-T C A AGC ATC TC G GC GTATTC A*G CA A'tACCTT T T T TTTCt TTTTC ?fin* <^ 
AATCTCAACC A UTG GGC GTATTC AG C.A AT AO£ T Tt^T'Tt T i t t?t ilrtfaiH 
GBAATCKAAGGArBTG GGC GTATTC A C C A AT AC Ctt TTTTT T C T t T T TC 
qC AATCTCAAGCAtG TG GGC GTATTC AGC UTACCTtTTTTTTCTJn^ 5«S 5?**. 
G C A A T C T C A A. G C A T.~G. T G G G C 6 T A T T C A C C A A T A 6 C T*T T,T TvT.T Y p-T TTTTC ^6buil2*seq^ 

;, A G T A t A G CG A TC TC A T.AC GCGGCTTGG G T C C A GTGTT CT AT TATA A GCTT Majority 

^1 3510 352Q . . 353Q 3540 3550 

3500 AG TAT.AGGGATCTGATAGGCGCCTTGGGT C C A CT GT TCT ATTATAATfTT ^vi ^ 

3500 UTATA.U6ATCTGATAGCCGGCTTGGGfcCA TGinn 

3501 A G T A TAGG.GATCTG'ATAGG.CGGGTTGGGTCCA G T G T T C T A T T A T A A r r 1 1 ^rs21 alZ.seq 
3500 AG TA.TAGGGATCT G"A TAGGCGGCTTGGGTCC-AGT^-TT^T^T^AT^AGC^T Sf^ 
3500 AGTATAGGC ATC, T G ,A. T AG GC GG.CTTGGGTCC AGTGTT C- T ATT ATA AG CTt hSBD^al^^seq 6 ^ 

TTGCTAACTCAAATCGTCTATTAATCTCTT T.AGTATTTAATTTTTGGGTT Majority 
' 35 . 60 ' 3570 ' 3580 * 3590 ' 3600* 



3400 
3400 
3401 
3400 
3400 



3450 
3450 
3451 
3450 
3450 



3550 
3550 
3551 
3550 
3550 



TTCCTAACTCAAATCGTCTATTAATCTCTTTAGTATTT AATTTTTff TTT 9finQ 
TTG CTAACTCAAATCGTCTATTAATCTCTTTAGTATTTAATTTTT GGGTT larazT n 
TTGCTAACTCAAATCGTCTATTAATCTCTTTA GQ ATTT AATTTTTGG GTT l^Sf ^ 
TTGCTAACTCAAATCGTCTATTAATCTCTTTA G T ATTTAATTTTTGGGTT ^ h n 
TTGCTAACTCAAATCGTCTATTAATCTCTTTA gQ ATTTAATTTTTGGGTT hSSb^aT^seq^ 



FIGURE 19G 



^S^IS^'WQ 2006/078318 m mel 

TCATTA! 



Weighted residue weight table. 



?CTYUS2005/027239 e g 



A -y A f | jjgggg^ e^^ g ty AGCTTTAATATTATAGTACCA Ifainrity " 

36 IO 3620 ' 



3600 
-3600 
3601 
3600 
3600 



3650 
3650 
3651 
3650 
3650 



3640 



I^ A II A I5 AAAC I TAGTTACTTGA TTATTAGCTTTAATATTATAGTACCA 



TGATTATCAAAGTTAGTTACTTGATTATTAGCTT T A ATATTATAGTAC 



A 

C A 



2603_al2.seq 

18rs21_al2.seq 

515_al2»seq 

cjblll_at2.seq 

h36b_aI2.seq 



ATTTGAAATAAAACGATATGAGGTTATCAAA A GACCAAr. TAAGAACAATA Majority 



3660 



3670 



A??£r AAA J A ATA TGAGCTTATCAAAAGACCAACTAAGAAC 
ATTTCAAA T A A A A C G A T A T G A G G T T A T C A A A A G A C eft A C T A A 



370O 



ATA 
A T A 
ATA 



ATTTGAAATAAAAGGATATGAGGTTATCAAAAGACCAACTAAGAACAATA 



CTATCAAGCCTACATTCAT CCATCGATTTAAAA.CGACCGATTTCT 



2603_al2.seq 

I3rs21_ai2-seq 

51S_ai2.seq 

cjblll_ai2.seq 

h36b_al2.seq 



3710 



3720 



TAACG Majority 



3700 
3700 
3701 
3700 
3700 



3750 
3750 
3751 
3750 
3750 



3800 
3600 
3801 
3800 
3800 



3850 
3850 
3851 
3850 
3850 



I?I A I S A UCCTAC ATTC ATCC ATCG ATTT A A A A.CGA 



3740 



37SO 



GTATCAAGCCTACAT.ICATCCATCGATTTAAAACGK 



D T A T C A A 

ctatca| 



1 C T T A AG G 

CCATTTAAAACGACCCATTTCTTAAGG 



c c t aQa ttcatccat 
gtat.caaccctacattcatccatcgatttaaaaccaccgatttctt.aagg 

TTTTTCTCAAATTTTCCTCCCATTATGATT C AATTCCTTTTCTAACA'CTT Majority 



2603_al2.seq 
-18rs21_al2lseq 
515lai2.seq 
cjblil_al2.seq 
h36b_ai2.seq 



3760 



■ 3 ™ 3790 

TTTTTCTGAAATTTTCC TAAC ACTT 2603^12^ 

?T?"c?G A A A llllcrlrWkr A I I A I ^ A T ^ C A A T T C C T T T T C T A A C A C T T iSrsZLalCeq 

TTTTTCTGAAATTTTrr^T r* r t * ATTC A ATTC CTTTTC TAAC ACTT S15.ai2.seq 

IIIII^I^^^IJ^TCCCATTAT ojbUI.ai^seq 

TTTTTCTGAAATDtTCCTCCCATTATCATTCAATTCCTTTTCTAACACTT h36b„a^. S eq 

GCTA A A C G ATTTTTTT.G ACGTTG ACCTTTTA T TA AC CA A ACT A A C C A AGC Majority 
_ ™£H ffg 3830 38 40 ~^~ 5Q 

c^TaTa^FaTtT^ 

GB^A AACGA^TTTTT^r a ^ ? J TCACGTTTTAXXAACCAAAGXAACCAACC ISr^lJaiCeq 

CCTAAACCATt?Vt?J r A ? ® " ® A " T T T T A T T'A A C-C-A- A A C T-A'-A C C A A C C Sl5_ai2.seq 

u t, JAAACCATTTTTTTGACG.TTGACGTTTTATTAAC CA A AT.T A A r r a a r n „tum 

gctaaacgatttttttgacgttgacgttt T a"uc?ua1] a \ A cc^ ^"Itl^T 

AATAATAACTAAAGATA TATACAAAAGATATCTATAAATC fi 
3860 



3870 



TGTTTAAAT Majority 



3880 



3890 



3900 
x_ 



IAGATATCTATAAATCGTGTTTAAAT 2603 ai2.seq 



AATAATAACTA AAGAT.ATATAGAAl 

AA ^ AA I AACTAAAGATATATAGA A " A GATATCTATAAAKGTGTTTAAAT iteSl aizTea 
AA^AA^XJC^^AGA^A^ 

AATAATAACTA A AGATATA cjblll_ai2?seq 

AAAAA1.AACTAAAGATATATAGAAAAGATATCTATAAATCGTGTTTAAAT h36b_aiZ.seq 



GACCGTCTTTTATT A A T T T TT CATC, AATAGG ACCTTT A T A A C G C A T A C G 



A Majority 



3920 



3900 
-3900 
3901 : 
3900 
.3900 



3950 
3950 
3951 
3950 



3930. 



3940 



GACCGTCTT TTATTAATTTTTCATCAA T.AC GACC TT'TA T A ACGGATACG A 2fi0<* at? ^ 
~ \ ~ -A C T - T Ji A T A T .T IT T.CATC AATAfi G ACCTTTATAAI ~ " - 4 



GACC 
G A CCj 



TCTT T 
TCTT T 



A T. T..A ATTT TTCATC A AT XBLg AC.C t-TT ATA AG Cl 
A T T.A A T T T T T C A T C A A TAG G A C C t T T A T A A G G| 



T T A T A A G G G A TACGA Si5_ai2.seq 



A T A C G A cJbllTlai2.seq 
I A TACGA h36b_al2,seq 



T C T C -C CC TTACT A A A AG T C T G TC .fC TAT T C AC CATAA T,C G G G G tVc A A C T Majority 

3970 . ■ 3980 3990 " 



3960 



4000 



aIIc * * A A T ° G G G ~ G - A A C T 4 



4000 
4000 
4001 



TG TCCCCTT ACT AAA AGT€tCTGTGT ATTC A| 

I G J| ?^ T T A C T A A A A G ^ 18rs21al2 sea 

T CBC C C C T T A C T A AA A G.T C T G T G T G T A T T a A?C A T A ATCGGG GTGC A ACT li^ZifZf^ 

G CTTAC TAAA AG T C.TG T G T G T A T TG A CC A T A AT t GGGGTGC A ACT -cjbTTl al2^sea 

TGTCCCCTTA. pTAAAAGTCTG-TGTGTA'TTG A'C; C A T-A ATCGGGGTGCA AG'T tft6b_al2^seq^ 

CAATAAGGTTGCATAATCA TGTCCAGGAACAACCAACAAATCTGAAAAGT 
4010 - 4020 4f«n ' 4040 ^ 4050 



Majority 



rMJM'r !I AATCATGTCCAGCAACAACCAACAAATCT ^ 2603_al2.seq 

C.AATAUGTTCCATAATCATGTCCACGAACAACCAACAAATCTGAAAAGT 18rs2~l aiHea 
£ A A I A "TGCATAATCATGTCCAGGAAC A A C C A A C A A A T C T G A A A ACT sl^S se^ 



4000 f|A ATA ACGTTGC ATA'ATC ATGTCC A GGA AC A ACC A A C HAT C TGAAAAGT ^ihil i \, 
4000 CA ATAA'GGTTGCATAATCATGTCCAGGAA'C AACCA AQ AA ATC T G A A A A G T h36b^aT2!seq 



FIGURE 19H 



Sg?K.1WQ2P.Q6/078318*" 



Weighted residue weight table. 



6S/4S7 



T A T C G C CT"T^ IBg S il i J B g ^ CTTCATATCCT ATCGTTTCT Ifalorl^ 
4060 4070 40«0 jnon 



PCT/US2005/027239io 



4050 
4050 
4051 
4050 
4050 



4100 
41 GO 
4101 
4100 
4100 



4150 
4150 
4151 
4150 
4150 



4090 



4100 



4200 
4200 
4201 
4200 
4200 



4250 
4250 
4251 
4250 
4250 



4300 
4300 
4301 
4300 
4300 



4350 
4350 
.4351 
4350 



4400 
4400 
4401 
4400 
4400 



4450 
4450 
4451 
4450 
44SO 



-r ^'-r G ^ GG ^ G A ^^ A ^ T ^ T ^ A TCTGATCTA,CTTGATATGCTATCGTTTCT 2603 a!2 seq 

TATCGGGTGTAACGACTTTTATCTGATCTACTTGATATGCTATCGTTTCT ISrs^Heo 

TATCGGGTGTAACGACTTTTATCTGATCTACTTGATATGCTATCGTTTCT SlTlifsL 

J^r^^^M CCACTTTTATCTGATCTACTTCATATCCTAT CGTTTCT cJbUl ai^seq 

TATCGGGTGTAACGACTTTTATCTGATCTACTTGATATGC.TATCGTTTCT h36b_a7 2 .seq 

TTTATGTTTTGAATATAAA A CTTATCTCCTTTTTTTAACTTTTTAAGGTT Majority 
- Al }° 4^ 4130 4140 4^0 

^^^"I^S AATATAAA ^ CTTATCTCC ^ T TTTTTAACTTTTTAAGGTi 2603 ai2.seq 
III!^H TTGAATATAAAACTTATCTCCT TTTTTTAACTTTTTAAGGTT 18rsiT a Heq 
TTT T TTTTGAATATAAAACTTATCTCCTTTTTTTAACTTTTTAAGCT1SI fai 2 seq 
'III !5 TTTTGAATA TAAAAeTt ATCTCCTTTTTTTAACTTTTTAACGTT cjblll aTTseq 
TTTATGTTTTGAAT.ATAAAACTT-ATCTCCTTTTTTTAACTTTTTAAGGTT h36b_a72 . seq 

AGAAAAGAGTTCTTTATC TGGAATTCCTGAATGCGCTGTTATAACAGTAT Majority 

™® 152? 4180 4190 4200 

A /^ A *^ A ^ A ^J^ CTTTATC ^ GGA A * T T c c T c A A T G C G C T G T T A T A A C A G T A T 2603 a 12. seq 
A^rAA'A^r^ rWU TrTTT«I^ G ^ AA ^^ < ' G ^ ( ' ^Jm ^^^^CTGTTATAACAGTAT 18rs21_al2?seq 
ACAAAAGA G||t CTTT ATCTCCA ATTCCTG A jRJ TGCGCTGTTATAA cBg TAT 515 ai2 seq 

AJA A GAGTKTTTATCTGGAATTCCTGaHtGCGCTGTTATAAcIgTAT cjblll aT^seq 
AGAAAAGAGTTCTTTATCTGGAATTCCTGA.AT G'Q GC'TGTT ATAAC AG T A T h36b_a72 . seq 

GTGTACTATTGCCTCCAAT TGGAAGACACG TA CCTTCTAAA-TGCCCTGCT Majority 

4210 . 4220 4230 - 4240 

-4- 



4250 



GTGTACTATTGCCracCAATTGGAAGAGAG G T A CCTTCTAA AT.G CCCT GCT 2603 a!2 seq 
°*« C T W C T A T T G C CBC CA AT.TGGAAGACAGGTACCTTCTAAATGCCCTGCT 18rs2^lHeq 
^MGTWC TATTGCCTCCAATTGCAACAG AGG T AX C T T C T A A A T GHC C T G C T S15 alijseq 
SIS TAT tH CCTCQAATTGG AAGAG aBg TACCTT CTAAATCCCCTGCT cjbUl at^seq 

G T.G T A C T A T T &C C T C C A A T T C G A A C A G A G C TBC C T T C T A A A T C C C C T G C T h36b_a7 2 .seq^ 

CCTTTAGATAGAAC TTCT TG AG TT G A A C C T G C A A A T AT AG GC AGTTTTT C Majority 

. fgfg .^70 4280 429 0 4300 

C C TT T A G A T A G A A C T T C TKACTTGAACCT GG.A A AT ATAGGGAGTTTTTG 2603 ai2.seq 
CCTTTAGATAGAACTTCTT.GACTTGAAC C.TG CAAATATAGGGAGTTTTTG 18rs21 al^seq 
^ G II TAGATAGAAC ' r T j£ T T. GACTTGAA C,*C TGC-AAATATAG GGAGTTTTTG 515 al 2 seq 
Prin A S A I A - AACT TBT T «ACTTGAACCT.GCAAATATAGGGAGTTTTTG cjblll ai^seq 
GCTTTAGATAGAACTTCTTGACTTGAACCTGCAAATATAGGGAGTTTTTGh36bal2 seq 



ACCTATCTTAGGAACTGAAATTGT 
— p- ■ i — 

4310 4320 



ATTTTTTCACTTACCTCTA AC A Majority 

4330 4340 4350 
-J : : l_ 



A ^I A ^I TAGGA ACTGA AATTGT||CC G ATTTTTTC A CTTACC TC T A A C A 2603 at2.seq 

A ^^ A I^I AGGAACTGAAATT ^ t " GCGATT ^ TTTCACTTACC 18rs5L.ai2.seq 

AGG I ATCTTAGGAACTGAAAT T DX GCCGATTTTTTCACTTACGTCTAACA 515_ai2.seq 

A ^I ATCTTAGGAACTGAAATtGTGCCG ATTTTTTCACTTACCTCTAACA cjblll ai2 seq 

ACCTATCTTAGGAACTGAAATTGTGGCGATTTTTTCACTTACCTCTAAC.A M6b_.aT2.seq 

■TACGGCCGTAGTC T.G G T AG.C C C.C T TT T G A A T T C G TTTTTT C T C A T A A G G A Majority 

.'- 43 i GQ 4370 4380 439 0. 4400 

T A CGG GCGT.ACTCTGCTA C C.C C C.T TT T G A A; T T ,C G T T T I T t CTCAT A AG G A 2603_ai2„seq 
I A S '5 5 G 2 G T A C T C T G C TAG0CCC T T.T T G A A T T C 0 T T TT T T C T CAT A A. G C A iSrsfcl aiSeq - 
TAG GGCGTACJG T.G-C T AHc C C GTT'TTG AA'TTCGTTJTTT C TC* A.TAAGGA SlS^izTseq 
I * S 2 "S5 S 5 I - S * C T G C T A C ° C T T 'T C A ATT C-G TT TflT TCTCA T A AGG A cjblll^a^seq 

TACGGGCGTACTC T'G CTACCC gBT TTTG A AT T CGTTKT f CTCAT A A G.G A K36b ai2 seq^ 



T C T t C A A G A T G G A C A T T A T T.T A. A A G A A T C ATT A T A A G C T T C T G C T A G A G C Majority 
4410. - 4420^ 443G \ 4440 . 4450 



4430 



T C T T C A A G A T G G A C A T T A T T T A A. A G A- A T C A T T A T A A G C T T C T G C T A G A € C 2603 ai2.seq 
TCTTC AAG A T,G GAC A TTATTTA A AG AA T C A TT AT AAGCTTGTGCT AG AGC ISrsSi al2 se^ 
TCTTC AA'G ATGGA C A TT.ATT T A A A G A A T C A T T A T A|GC T.T GT G CTAG A G C 515 a 12 seq 
T C T T C A AG A T G G AC AT TAT-TTAAAGAATC A'T TAT A A GCTT.GT G.CT AG A G0 cJbXil_alTseq 
T C T T C A aBaTG.G ACATTATTTAAAGAATCATTATAA GCT TGTGCTAGA G" C h36b al2.seq 



CATACC TC G 



ATTCATTTCTTTC 
— 1 — — x 

4460 * 4470 



TGACTA AGTTTTTTAGC.AGCTCTCTCAA Majority 
4480 . 4490 4500 



CATACGTCGATTGATT T C TT TC T C A C I A A G TTT TT T AC C A G C T CT C T C A A 2603 ai2.seq 

CATACGTCGATTGATTTCTTTCTGACTAACTTT.TTTAGCAGCTCT-CTCAA 18rs21_al2 seq 

C G A T T G A T T T C T T T C T G A C T A A G T T T T T T A G C A G C T C T C T C A A S15_ai2.seq 

CGAT TGATTTCTTTCTGACTAAGTTTTTTAGCAGCTCTCTCAA cjbl 1 1 ai2 seq 

CATACGTCCATTGATTTCTTTCTCACTAAGTTTTTTAGCACCTCTCTCAA h36b_al2.seq 



C A T A C G| 
C A T A C G| 
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>CT/US2005/02723? ; n 



4510 



4520 



CTATAGTAAAAACGTGATACC Majority 



4530 



4540 



4500 AATCCTGTGTTTGATTA TTAG ATTCTATCCTATAGTAAAA'AC 
45GO A A T C C TGTGTTTG AT TATTAG ATTCTATCGTATACTAAAAAC 

4501 AATCnTGTGTTTCHtTATTACATTCTATCGTA B 
4500 AATCCTGTGTTTGATTATTAGATTCTATCGTATAGTAAA A A C 
4500 A A T C C T G T G T T T G A T T A T tBg A T T C T A T C G T A T A C T A A A A A C 



4550 



GTGATACC 2603_al2.seq 

GTGATACC I8rs2l_ai2.seq 

GTGATACC 5l5_al2.seq 

GTGATACC cjblll_al2.seq 

GTGATACC h36b_ai2.seq 



ACTGCATACAATAAAATAGAT. AGACCTAT T AGAAAAAGAATGATAAAAGG Majorfty 



4570 



4550 
4550 
4551 
4550 
4550 



4580 



4590 



AGTGGATACAATAAAATAGATAGACCTATTAGAAAAAGAATG 
ACTGG ATACA ATA A A ATAGATAG ACCTATT.AGAAAAAGAATG 
ACTGCATACAATA aBaT.A G AT AGACCTATTAGAAAA AGAATG 
ACTGGATACAATAAAATAG ATAG AC CTATTAGAAAAAGAATG 
ACTGGATAC A AT A A-A ATAGATAGACCTATTACAAAAAGAATG 



4600 



ATAAAAGG 2603_ai2.seq 

ATAAAAGG 18rs2I_ai2.seq 

ATAAAAGG 5I5_ai2.seq 

ATAAAAGG cjblll_al2.seq 

ATAAAAGG h36b_al2.seq 



AAGATTTGACTTCTTCTTTTT - 



4610 



4620 



4600 
4600 
4601 
4600 
4600 



4650 
4650 
4648 
4647 
4647 



.4700 
4700 
4698 
4697 
4697 



a ag a t t t g a c t t cttctttt _ 
aagatttgacttcttctttttI 
Dagatttg aH ttctt|t t t t t 
aagatttgac t.t cttcttttt 
a ag atttgacttcttcttttt 



TGTTT TTTTGATGATTTTTTTAGTCT Majority 

4650 



4630 



4640 



TGTTTTTTTGATGATTTTTTTAGTCT 2603_ai2.seq 
TGTTTTTTTGATGATTTTTTTAGTCT 18rs21_ai£seq 
TGTTTTTTT GAT||A TTT TTTTAGTCT 515_ai2.seq 
™ G J T T T T T T COT G A tBt.T T t T A G t C T- cjblll_al2.seq 
GTTTTTTTGATGA tB TTTTTAGTCT h36b_ai2.seq 



TCACCTC 



ATCTCCTAAATAATGGCTCTT GCTTATGATCTAAG ACTA CTTC Majority 



4660 



4670 



TCACGTCATCTCCT AQa T A AT- G 

tcacgtcatctcctaHata^tg 

TCACGTCATCTCCTAAATAATG 
TCACGTCATCTCCTAAATAATG 
TCACGTCATCTCCTAAATAATG 

TACTGAAATACCCTTAGATCAT 



4680 . 4690 \ 4700 

CCTCTTGCTTATG AT CT A AG AGTACTTC 2603 ai2.seq 



4710 



.4720 



- * k v ^ j. *«itoii-i^iAAOAlirACTTC 2603„ai2.seq 

GCTCTTGCTTATCATCTAAG ACT A C T T C 18re21_aI2.seq 

G CTCTTGCTTATGATCTAAG AGTACTTC 515_ai2.seq 

GCT.CTTGCTTATGATCTAAG AGTACTTC ojbl ll_ai2 seq 

GCTCTTGCTTATCATCTAAG AGTACTTC h36b_jai2.seq 

A A G C A C A G-C T.T T A A C T C T G C TT A T A C A T Majority 



TACTGAAATACCCTTAGATCAT 
T.A C T G A A AT AC C C TT AG AT C AT 
TACTGAAATACCCTTAGATCAT 
TACTGAAATACCCTTAGATCAT 
T A CT GAAATACCCTTAGATCAT 



4730 



4740 



4750 



AAGCACAGCTTT A AC T GH GC TTATACAT 2603_ai2.seq 

A A G C A C A G C T T T A A C T cflc C T T A T A C A T 18rs21^ai2.seq 

AAGCACAGCTTTAACTCTGCTTABaCAT 515_al2.seq 

AAGCACAGCTTTAACTGTCCTTATACAT cjblll_ai2 seq. 

AAGCAC A GCTTTAACTGTG/C TTATACAT h36b_a£2 seq 



C A T C A A A 



C A C T A C 
r 

4760 



CC TTAAGCTTCCTCTGAiTTGACGTCTTTTCATGATAA Majority 
47 ™ 4780 4790 4Rno 



4750 
4750 
4748 
4747 
4747 



CA T C A A A 
C A T C A A A 
C A T C A A A[ 
C A T C A A A 
C A T C A A A 

CTACTGC 



GACTAGCCTTAAGCTTCCt! 



•TGATTG 
T G A T T Gj 



4800 



WW r J Jim " J A fi£ J T C C tU * « A 1 1 G|fl C : G TCTTTTCATGATAA 18rs2JL.al2.sec, 
" A ™C ll A A TTCCTCTGATTGACG T CT TTTCATGATAA 515_ai2.seq 

CTAGCCTTAAGCTTCCTCTG A- T T G A C G tQ T T TTCATGATAA clblll ai2 sea 
CTAGCCTTAA-GCTTCCTCTGATTGAC GTC TTTTC ATGATA A h36b IiS s^ 



G A C T 



CCTCTTTTCATGATAA. 2603„ai2.seq 
CG TCTTTTCATGATAA 18rs21_al2.seq 

n T P • T T T T r- A T C A *r * A etc - t 



G A 



TTTTCATGATAA h36b_al2.seq 



TCCAAGCATAATGCTTAAA C ,C AATAATTGTCAAAAGAATTGTA Majority 



4810 



4820 



4830 



4800 
'4800 
.4798 
4797 
4797. 



4850 
4850 
4848 
4847 
4847 



4840 



CTACTGC 
CT A C T G C 
.C T A C|I G C 

.C T A C T G C 

Bt a cQg C 

CCAAt AC 



J^Mr^IJfH CTTMAp C A AT A ATTGTGA A A. A GA A T T G T A *26Q3_ai2.seq 
I C 9 A A r fi I * tl&n 1 1 * "f AS C A A T A A T T G T G A A A ' A A T T G T A 18^21^1^ 
I S 'n J f V S f I A A I BC T T A A|a C C AA.-T A'A T T G T G AAA AB A ATT G T A. 515 al2.seq 
t r r a i r ? i J J A I ? C ^ T A A A ° C A A T A A T T c T G A A. A AG A A .T T.G T A cJbTll a!2?seq 
• C A . A G C A T A A T G C T T A A A C C A A X A A t T G.T G A A -A A G A ATTGT A • ft36b.Ja72 . seq 

C A C C T C T T T G T G G CA T T G T t A C C T f T T T A T T T T C TAG t C CT T G Maiorl tv 
4860 4870 4880 " 



.CCAATAC 
C C A A T A C 
CCAATAC 
CCAATAC 
CCAATAC 



4890 . 



CACCTGTTTGTGGGATTGTTACCTTTTTATTTTCTAC 
C A C CT G TTT G T G G G A Tt GTTACCTTT T T ATTTTCTAC 
C A C C T G TT T G T G GlJ A Til G XB A C C T T|l T T A T t ITCTI A C 
CACCTCTTTGTC G G A TM G T T A cA TT TT tfi T TTlffl A C 

cacctctttgtggHa t||g tB a cBt t tttHttttct Af 




CGC ATCITTTTTTT-TGCT GCTACC AGCGTAGTCAATGTTACCTG - 
- 4920 4930 4940 




C CCATCT T T T t BW TTGnf G 
CGCATCTT T T tHB T T G C 
G C A T C T T tHBtT T 

GC ATCTTTTT T^jJ 
G Cfl TCTTTTTT 




CGTAGTCAATGTTACCl 
CGTAGTCAATGTTAC< 
T aHt C A A tHtI^ 
T aIIQ CB AT G t|_ 

t a||tc AATGTTACCTG- 




•4900 

2603„ai2.seq 
18rs21_ai2.seq 
GT T G 515_al2.seq " 
C T T G "cjblli_al2.seq 
GTTG h36b_al2.seq 

- A A C Majority 



4950 

.2603_al2.seq 
A C 18rs21_ai2,seq 
A C S15_al2.seq 
" cjblll_al2.seq 
h36b_aI2.seq 
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PCT/US2005/027239 12 



C T T T T CTliC'T'lK'.'T. 



4960 



'CilAHC^; fjTgG.A pj'jjg ipgj|A AA CTTAACATCACCTGCCAAT Majority 

4990 5000 




TC TATG AC 
TGTATC AG 
GTATCA 
T A T G A 

t||t G A 



4970 




TTGATTAAC T»C A A A C T T A All A THACCTCCCAaB 2603_al2.seq 
LTJTG. A T T A A CTlfCA A A CTTA a||'A t||A C C T C C C A Aff 18rs2l_ai2.seq 
T A A CBtDa A AcHHaA CATCACCT cQQ A A T 5I5_al2 seq 
I A C T T C A A aHt t||a CATCACCT ct&JC A AT cjblll__al2.seq 
ACT TIMfl A aH tHTa A CATCACCT G eft A A T h36b_al2.seq 



.TTAGCATAACCTGC 
i 

5010 



TtAGC A| 
T T A G C 



TGGAGCTTGTGTTTCTTCAACGTTGTAAGTACCTTC Majority 
5020 5030 5040 5050 



4997 
4994 



S044 
5044 
5048 
5047 
5044 



ICCTGCTGGAG c|WJ G T G T T T C T TC A ACGTT CTA AG T A C cffll C 2603_al2.seq 
C C T G C T G GAG cfflGT G T TT C T T C A A GGTT G T A ACT A C cQt C 18rs21_ai2.seq 
TT A G CA.TAAC cIJg|1t G g|| G C T T G T G T T T C T T ctgA iWWSlG T a|D|ac CTTC515 ai2 seq 
BtHG CAT A ACCTG C T G g|1g C T T g||g T T T C T T cHABfc IMln T AaTtWI C C T tV 
TTAG CflTAAC CB G CiTGGAGCTTGTGTTTCTTC A A GjflfTTH T A aQBaC c| 



cjblll_ai2.seq 
h36b_al2.seq 



TGCAAGACCTGTAA 



5060 



CTTCA AATTGA.CC-TTCATCGTTTGAAGTCAATT.TAA. Majority 
5070 5080 5090 5100 



TGCAAGACCTGTAACTTCAAATTGACCTTGATC GTTTGAAGTl 
T C CA A GA CCTGTA AX-T TC AAATTGACCTTGATCCTTTG AAGT 

IjGlnAligc cHg t a aHt tcaaattgac c ttBat cg t tWg A A 
HncllAH a c cBg t a aQt t caaattgac cQBttSBc g t t|)g a"" 

TGCAAGAC cHH TAACTTCAAATTGAC cR tIT 



I A MB T A A 2603_at2.seq 
AHflT A A 18rs2i_ai2.seq 
TCA A.TTTA A 515__ai2.seq 
, TCAATTTAA cjblll_ai2.seq 
CGTTTG AAGTC aQ TTTAA h36b_at2.seq 



CAACTTTACCAC C -TTTTTATCTGTTACCCACTC ATAAGCTGTACGAGCT Majority 
5110 ^ 5120 5130 5140 5150 

I 2603_ai2.seq 




5097 
5094 



5138 
5138 
5148 
S147 
5144 



5188 
5188 
5198 
.5197 



TTATCTGTT All C C A CTC AT A A G CTGT A C C A G clfc ; 
. T T A T C T G T T AQ.C CACTCATAA G CTGTACG A G cQ 18rs21_ai2.ieq 
TT T T T A TC THK] A C C C a|Jt cHt aH g BITOMMI A G C T Si5^al2.seq 
T T T T T A TCT gBH A C C C A|| T CAT a|1 g HIMIW A G C T cjbin_ai2. se q 

THT T T A TIKI G T T A C CC AC TC|Ht A A G CTGT AC GAGCT h36b_ai2.seq 



TCAATGAAGCCTGCATTGTAAGCTTCTTCTTTACTTTTCATAGTTGCTTT Majority 
5180 — — T 



5170 

_JL 



5180 



5190 



5200 



TCAATGAAGGCTGCA tH g t A All c tBc ttgtttagttttga 
T -& A A JL£ a a g g c tg catQgtaaHc tHc t t g t t t AGTTT TGA 

| A A ED A A Ji G C BB G A T T G T A A G C T rm T T G T T t !««M tKI A 
_ lA ABE A aHg cHHC ATTG.TAA GCTt|tT G TT T IMB tHB * 

! CH ATQAAGGCTGCATTGTAAG cH TCTT g BH A G T T T T G A 




CTTT 2603_al2.seq . 
CTTT 18rs2 i_ai2.seq 

ClflU cjblll_ai2.seq 
C T T T Ji36b_al2.seq 



TGCTGTTTTACCTTTTTCACCTTTTTGTTCTTCTCCAGACAACTTGTTAT Majority 

5210 5220 . 5230 5240 5250 

I G T| * 



TGC H GT W^lT CCTTTTT B A CCTTTTTG|jTCTB|i G C A ; A C A < rTGTTAl >603_ai2.seq 

T G CHGTIHtHc CTTTTtBaCCTTTTTGHtC tHIT GCAGACAACTTGTTAT 18rs21_ai2.seq 

ITBXTGICC TTllTTG.TT||TT C t|BJ A g||c A aHtDH T THT 515__al2.seq 

1 T T G T TflT T C TBB A Gfl C A aH tBg T T A T cjbill ai2.seo 

UCCTTTTT gBt^Bt C T 7 G 7 CB!g AHAlMlT T G T Tfli h36b_ai2.seq 

A A GCTG CGATAGCTTC AT'C TA A AGCT ATTTTAGT ACTA, G CTAAAGCTCTT Majority 
52 . 60 5270 . 5280 5290 ". 5300 



T G C T G|| T T T A C C| 

T G C T G T T T T A C C T T| 



5194. BGCTGTT TB A C C T T ll 



5238 
5238 
5248 
.5247 
5244 



5285 
.5285 
5298 
5297 
5294 



A A G 0] 

A A G .ej 

A A G C 
A AG C 
A AG C 




ATAGCTTCATCTAAA GCTATTT tWBI TA G| 
( AT. A G C TT C A TCT A.AAGCT AT T T TlBIt A G 
" A G CHB CAT CQa a||g C-tEBSt T T A G tB['G| 

' C/-t;s c a t eyAjBI g cliB-T tta gOTa G 
IcHt cHaHa g gHPt.t t a gHaII 




c C C T A A A G^pJ T T 2603_ai2 .seq 

_G_C T A,A A $BKrt 18rs2CaI2/seq 
I A G cBgH t 5 . seq 

| A GCt CHH cJbUlJ.ai2.seq 
A G C f G t T h36faLal2.seq 



TTTT.G.AGG T G C T TTTGCTT CAT C T ,G C TTTAAG TGC A AC G T.A T T T A C C TG C Majority 
5310 5320 5330 * 5340 5350 




T T T T G A| 
T T T TjG 
T T T 
T T T| 
fflTTT GrA G C T G] 



t gBJ t t t| 



A T C TG.CTTTAA G«G CAAGGTATTTACCTGC 26031^12.seq 

A T C T G C T T T A X GH G C A A G G T A T T T A C C T G C 18rs21^ai2.seq 

CADCT gBtT T AHG T G CQ AHG T ABt t|Jc cDnjc S15_ai2.seq 

C ABc.T GCTTT aBgTG cQ aQg TATT JBc CTHc cjblll_ai2.seq 

TG CA AG GTATTTA C.cHg C h36b_ai2.seq 



TGAGTTT TTA 



A AC AAGC A A TT GTGCACCTGCCAAGCG. TTCACTTT. CAGGTT Majority 
5360 • 5370 ■ 5380 5390 5400 




TGAGTTTT TH ACAAC.GAATTGTGCAC CH GCCAAQcgBt C A 
T G A G T TIT t||a C AACGA A TTGT G C A C cQ G C C A aH C €||t C 

|T TljjT T A A CAA0G A A fflffH c CHCCTGCC A AGCGTT C_ 
IT THT T A A CQAQG A aQQBD G ClI C C T G C C A A G C G T T C A GT| 



tttaacaacga a|!t ctgcacc tQQca a g c g t t cIIg t t t c a g| 




GTT 2603_al2.seq 
T T 18rs21_al2.seq 
T T 515L_aT2.seq 

cjblll_al2.seq 
h36b_ai2.seq 
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Weighted residue weight table. 



c a t t t gp£?j'G^ mgm 

5410 .. 5420 



'CT/US2005/027239 a 

yjjgg nCACAACTTTTCGTTCACTT Majority 
5430 5440 5450 



5379 
5379 
5395 
5394 



5429 
5429 
5445 
5444 
5444 



cat tBg TTTTGACAAATTTCTTACCATGAG 

C,A T tQg t t t t g a c a a a t t tcttac c a t g a g 

ATTTGTTT T|j A C A A A T T T lMWHU C C 

atttgttt t|1 a c a a a t t tQBDH c c 

AB TTGTTTTGA CH A ATT T||T T A C C 



: a tQ a g f 

- A tIIa G T 
?B T G|I G T 



[CAACTTTTCGTTC AGTT 2603_ai2 .seq 
CAACTTTTGCTTCACTT I8rs21_.ai2.seq 
C A CBA CTTTTGGTT C||gQt 5I5_al2.seq 
C A Cfl A CTTTTGGTT cHgHt cjblll_ai2.seq 
TCACAACTTTTGGTTCAGTT h36b_aI2.seq 



GGGTTGATTC.CTCTTGGCTTATCTGAATCTTTGCTATTGGTGATGCTTAC Majority 
! 5460 5470 5480 5490 



G G G T THj ART GGTGTTGGGT 
GGGT tfl AH TGGTGTTGGGT 
GGGTTGATTG CH.C T T G gW tI 
G G G T TCATtG cfl G T T G GQ t| 

g gQt tD aDt.g gtgt-tgggt 



ATTACC ATTTTC AAGAGTTAT - 




5500 



55 lO 



5520 



GAATCTTTGG T A T T G G tRa TGGTTAC 2603„al2.seq 
G A A TCTTTGGTAT T G G TEA TGGTTAC 18rs2l_ai2.seq 
G A M T _ C . T T T G D T B T T D| T GATGGTTAC 515_at2.seq 
G aBTDT TTGIItBt TBHT G ATWG_THA,C cJbilLai2.seq 
G A A T C T^BQ G TATTGGTGA tH gHtIH C h3Gb„al2.seq 

TGC ACTA CCG TA ACCACTAAC ACCTT Majority 
5530 SS40 , 



5479 WTTACCATTTTCi 

5479 BtTA C C A T T TT c| 

5495 ARMH C C A 

5494 ABBA C C A T 

5494 aQt A C C A| 

CTGAGATCATGT 




5540 * 5550 

G T A A C C A GH AACACGTT 2603_ai 2 . seq 

G T A A C C AgH AACACGTT 18rs21_aI2-seq 

G T Aljc cWnTF AACACGTT 515_al2.seq 

TAIBc cHHt A A C A C GT T cjblll„ai2.seq 

T A A C C A GDB A|I A CGTT h36b_al2.seq 



5526 
5526 
5545 
5544 
5541 



5573 
5573 
S595 
5594 
5591 



5560 



ATGTTTTATTAT C ATCCAGACC AGTC AATTTACCTGCG Majority 
5570 . 5580 . 5590 5600 




T C C A 
T C C A___ 

CAGACCA 



5570 

CT.GAG ATC AT G T A T GKJ T T T 
C T CA G A TCAT G T A T G_9 T T T 

c D0DD A D (9 c T A D g t t t t 

cBBPfl a QBBI g t aH g t T T T a 

c t g a g a t c a t g t a t g t t t t a 

aagttaccagatactgtaaat 

: — r~z 1 

56 lO 5620 . 5630 

A A G T T A C C A G A T A C tQQ AaItTTGATACCA 
A A G T T A CCA G A T_A C TI M A A A T T T G A T A C C A 
, « ^ «f , mmmm m^mm^ T G TiWWJ t t tHMBWWJ a 

„-_. T G TUtU T Tfl^nSQBH A 

ACTGTAAAT TTBa'T A CCA TTAC.C 



5580. 5590 

« i 

AGACCAGTGAAT-TTA.ee- 
A fiACC A G T G A A T T T_A ** 

a aQQt 

A C C|| G T G A aHB T 




2603_ai2.seq 
18rs21_jai2.seq 
515_ai2.seq 
CjbUl_al2.seq 
h36b„al2.seq 



T G A ATT TAC C| 

T T G A T A C C A t T-A C C A A T G'T C C A T T G T A C C Majority 

ccon - 5640 5650 
I : i_ 

GTCGATTGf A CC 2603_al2;seq 
* * C G _ A j_ T G T A C C 18rs2i_ai2.seq 
HB T " T BHI C 5 1 5_a 12 . seq 
IBBt G T A C C cjblll_.al2.seq 
GAT tDDc C h36b__ai2.seq 




A G G T - - 



5660 
__i 




-GTTT TTGTCAATGATACTGAAGCAACAGCTGTATCAT Majority 
5670 5680 5690 5700 



C A A T GAT J 

c a aQgIH j 

C A aB gDQ / 

C A A T G A Hi 



5667 
5667 
$689 
5691 



5717 
,5717 

5718 
•5717 
.5738 



- GTTTTTG TC'AATG A T ACTGAAGC AACA G|1t|1T A T cQ T 2603_ai2 seq 
G T JLJLT. T lP_ m° A A i G A T A c T G A AGCAAC A^gJBI T0 T A T CQT 18rs2i._ai2.seq 
tBHtBH C A AH GOB *BHM| A G C aB,C aBTc T GjJBjtC A T 515_ai2.seq 

AIKKIIa.G CBDDaB C T gB^B C A T cjbl.ll_ai2.seq 
I A C T G AB GCAACAGCT G t iW C Afl h36b_a!2.seq 

TA TCTTT A AAIC TGTAA AC A ACGTTT A CATTA TC T'G GTTC.ACTACCTTCT Majority 

5710 5720 5730 5740 \ 5750 

[ "2603_jal2.seq 
| 18rs2l_ai2.seq . 
| C f 515Lai2-seq 



TATCTTT 
t At C T t t 
ODD C TTT 

f aBcDtBa 

D A t C t T TA 




cjblll_al2.seq 

h3(Bb^.aI2.se^ 



A t; a A c 

G C C C AA g T TTT A G T A A CTGTTAT TT C A C C C T T T G A T G .6 T C f A ACT G G T A G Majority 



5760 



5770 



GgC CAAGTTTTA g||a AC TGTT AQ T T 
GDC CAAGtTT TA gB a ACTGT TA||t T 
G C CC AilGT T T TffllTA A cHg TH A T T T| 
G C € C ABB T X T >H TB A elm Til A T T T 
HCCCAAGTtT TOST A A cflc tB A T T T 




5790 
J. 



5800 



G GtGTAA Cj 
C GTGTAA Cj 
G gHO t ABc T 

g gHgtaBc t 

GG TG tI|A C T 




2603_ai2 < seq . 

i8rs21_al2,seq 

515_al2,seq 

cjbm_al2_seq 

h36b„al2.seq 



TTCTTTCATTTCTTTACCTCGTTTGTTACCATAGTCCAATTTGATATCAf Majority 
S810 5820 5830 5840 5850 




TCTTTACCTG GTTTGTT ACC ATA 
T C TT T ACC TGGTTTGTTACCATA 



r t 






; a 



CA ATTTGAT ATCAT 2603_al2.seq 
CAATT t^£ AT A T C A T 18rs21_aI2.seq 
*** "atCAT 515_al2.seq 



tms&msmm * - «uT tHt TACCATAGTCCAATT tHaQaT CAT cJbUl_ai2.seq 
TTTACCTGGTTTCTTACCBt a||tHHaBt TTGATAT.CAT h36b_aI2.seq 
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£g*l TOS CCGTT AACAGTACCTCTATAA Majority 
5870 5880 5890 5900 



5860 



5817 
5817 
5818 
5817 

5838 



5867 
5867 
5868 
5867 



T g gH t T C T G G A T tW T C A All A a tB o 
T G GOT T C TGG A TTH 't C A aHa A _t|| G 
T G G T T T C TUG A|Bt t C A AcBaBaG 

T G G T T T C T G G JMt T T C A A C aDFt A G 

t gBtttc t g gBTt t|Jt c a|Ic A A T A G 




C CWT TAACAGTAGC 
C cH TTAACAGTAGC 
C C G T T|| AHA G T A G C T 
CCGT tQa CRG.f A G C T 
ACCGTTAA CD G tK G c| 



C T A T A A 2603_ai2.seq 
C T A T A A 18rs21_ai2.seq 
***** T A A 5I5_al2.seq 

T A A cJbUI_ai2.seq 
TAT AH h36b_a!2.seq 



GTCAATCTGAATTCAACAT CTTTTGTTTTT GC TGCTTTTTCCACTTTTCC Majority 
5910 5920 5930 5940 



GTCA ATG TB,A A T T C A All A T < WMU T G T T T Til G CTGCTTTTTCCaI 

G A _ LP. tH aa t t c a a "*a t cIHHt gttttHgctcctttttcca 
g tDaihIt g a||t tcaacatctt t tgt ttttgcHgcttt tBc C A 
g tIH aOBjt g aBttc.aacatc t t TBni t t t t g cBgc tIBD^IH cD a 

GTeAATGTGAATTCAACAHcT tHtG T T tI8W G C t|I C T T T T TliH A 




2603_ai2.seq 
18rs21_al2.seq 
Sl5_al2.seq 
- Gjblll_al2:seq 
G|| h36fa_al2.seq 



CAAACCTTTAG CTGTCAATTTTAATGTGAAACCACGG 
5960 5970 5980 




T C A T C A G T T C C A A 
, r- 

5990 



Majority 



AGCTGTGAAT T T T A A 
£ CT G T G A A TTTT A A 
c Pg tR A AQJQR1 Af 

g cm tH a aBHSB a I 

T AU CTGTG A ATTTHA a| 



TGT G A A A C C A C G GfflC A T C A 
T G T G A A. A C C AC G GQC A T C a| 
G A A A C cIMg G T C A T 
G A A A C C US GGTC.ATCl 
G T G A A A C.C ACGGTCATC 




A- 2603_al2.seq 
A A 18rs2l_ai2.seq 
A A 51S„ai2.seq 
Cffl A' cj bl 1 1_ai2 seq 
C A A h36b_al2.seq 



GTTTC TAGTCTGTATCCTTAACAAAAGTTC CTG 
1 ~~ ~ ~ ~ 1 

6030 



C CGATGCTTCA Majority 



6010 



5967 
-5967 
5968 
5967 



6017 
6017 
6012 
6011 
6032 



GT TlWlT A Ore TGTATC 
G T TuBlT AGTCTGTATC 
GTTTG.T AH tA TGT aW c| 
G T T T G T A G t|| tBH aRTC 

I ttgtaHtgtgtaHc| 




6Q50 

G aR g C T T|| A 2603_jai2.seq 
GAllG^TTilA 18rs21_ai2,seq 
A T Gil TBC A 515_al2.seq 

TQcD cjblll_ai2.seq 
h36b_ai2.seq 



AGGTTAACACTGTTA 

.6060. 




CC CATTCTC A A A C C ATTTGAC ATG.CTATCTGT.C C A Majority 
. 6070 6080 " 6090 6100 



CCCATTGTCAAACCAT. T.TGACAT, 
ACGC AT T GTCAAACC ATTTGAC AT 



GTC AAACCl 
GTCAAACC! 



Bt tQ g|| 
tth g|| 



Hi 



(CATGCTATC 
CATGCTATC 



A A C A G T C T T A C C C A T T G T C A A A C C A T T T G A C A T cQt A T C 



TATCT-GTCCA 2603_jal2.seq 
T A T CjT G T CCA 18rs2i_ai2.seq 
GTCCA 51S_ai2.seq 
C T CCA Cjblll_ai2.seq 
CCA h36b_ai2.seq 



AACCAAGTTTTTGTATT TAGAACCTTTGTGAATTTTTGTTTTA AC TTCGT Majority 
6110 6120 6130 6140 



6067 
6067 



6058 




AACCAAGTTTTI 
A A C C A. A Jr T TT, ' 
D ADD A j 
A A C C .a| 

aDcDaI 



6150 

G f A T T T A G A A C C TTT G T G A A TTTTT GTTTT A A CT t CBt 2603_ai2.seq 
& T ATTTAG A A C C T T TGTGAAT TTTTGTTTTAACT tJIt 18rs21„al2.seq 
T ABtHI.G A A C C T T tHR GB A T T T T T G T tBWa A cB tBTgTt 515_al2.seq 

C C T T T||H G A ATTTTTGT 1 |BH A A Cl MB G T ejbill_al2.seq 
C C T T TflT CB ATT T T T G T TT'TAA CTtC G T h36b_ai2_seq 



T AQ tDU g A| 

G T A T T T A G 



AAGGT ATAACTTTACCGATTTCAGCAGTAGCAGTTGCTTTGTCTTTTTGA 
61 60. 6170 6180 6190 620 



Majority 




ACCG.AT.T T C A G C A G T A G C ASG T T G C T.T T G T Cl 
A C CG A ^rjrTCA G d A',G ,TA-G. C A G f t G. C T T.*F:G t G| 

G CM ADEBQQ G cMj.;p MBW .C A C T 11 C X T T G-T C T t>t t| 

a c cB aHimIm g c II fG^BWa c/a MIMPH c t t t g t c t t jr v 

ArXX C G. A T. T" TTC A • G C Gt*t# & cWM Wr W$ H T T t|| t c -j 



2603^ai2 .seq 
18rs2jLMi2^seq 
5l5,jaI2.s«i- .■ 

Haeb-ai^-seq *• 



• A AMUG i T|flA C CG ATTTC A G C ACTA 0 CDl T T G C T T TH T C tHBHIt G A .h3Gb_a!2.s 

T A G T T A G CAT A A T C f G C G C C A G CT' GTCAA A- A C T C T A T T A A C A ^ C t.G T C A A Majority 
6210 * ffr?n - c9on — " 




TA AT 
T A AT 
T A AT 
T A A T C — 

t a|8t C T 



6220 . 6230 6240 

T G C G C C A G C T G_T C A.A A A G T C T A T T A A C 
" CCCACCT C.I C A AAA GTCTATT A A C 




6250 

C T GTC A A 
T O TC A A 




2603_ai2.seq 
18rs21_lai2.seq 
515^jai2.seq 
cjblll^ai2.seq 
h36b_al2-seq 




TGTC A A CTTCTT TC TTTT -A GCA A AG TTTTT A TC TATTTGTCG TTTTT Majority 

6290 * 6300 •" 

±- 



6260 



6270 



6280 



T GTCA AH iD G T T TGTTTTAGCA A A G T T TT T AT cIIa T T TflT G G TTTTT 2603_al2 seq 
TGT_CAAnTDGT T TG TTT TAG C A A A G T T T. T T A T c|J A T T tH T G.G f T T T T 18rs21_ai2.seq 
A A GTTjjT T TljT|aT»G CfflA A G T.tH T T A T C T Ajj T TGTGGTTTTB 5I5_ai2.seq 

a a GTa ?D t t tEVtJDtHI g c a a a g t t||t t a t c t aHt t g t g g t t ttP 
c a aBttBtt t gBttt a gcaaactttt tBt cBa tttgtggtI 



cJblU_al2.seq 
T T h36b_a!2,seq 
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per; 

6310 



"" '6320 '"" " " 
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|TCTTCAACAACACCATCTTCC Majority^ 
6330 6340 6350 



6267 
6267 
6214 
6213 
6279 



6317 
6317 
6264 



CTTC ACTCTTCTT TGGATAAA cWt 
CTTCAGTCTTCtTTGGATAAA CQT _ 

TTCAG TC TTCTTTGGATAAACGTGAGC| 



T T C A G 
C T T CH G t| 



[TTCTTTGCATAAAl 
TTCTTTGCAT aB A ) 



TGAGCATCT T| 
ItH A G CQT C| 




(iCAACAACACCATCTTC 
CAACAACACCATCTTC 



AACAACACC AT 
A^ A C A A C A C c||t 



_ en _ 

A C A ADD C C|IT CTTCG h3Gb__ai2; S eq 



G 

—r- 

535( 

6 



2603_al2.seq 
18r*»2i_al2.seq 



TCG 515_ai2.seq 
TflC cjblll_ai2.seq 



TTTACCAATCG AAGAGTGATTTTAACTGGAACTGCTTTTCAATCAGCCAG Majority 

6370 - 



6380 

1 : 

TTAACTGGAAC 
TTAACTGGAAcj 



6390 

GCTTTTGAA 
GCTTTTGAAi 



6400 

C A G C C A G 2603_ai2.seq 



TTTACCAATGGAAGAGTGA T 

TTTACCAATGGAAGAGTGA THT TAACTGGAA CQ G C T T T T G A aQc A G C C A G 18rs21^ai2 ^ea 
TTTACGAATGGAAGAGTGATTTTAACTGGAACTGCTTTTGAATCAGCCAr 
T T T A C C A ATGGDAG AGTG ATTTT A ACTGG A ACTGCTTTTG A AT C A G C C A 1 

J ^fKV /* * * BK« r-. r» * * i -r. ,» j *»v ^.M .. — . . M „ „mm - - _. _. _ . mm •*■« 



515_al2.seq 
cjblll_ai2_seq 



C A.AllG GAAGAGTGAT T T tQ A CTG gQ A cQg C T TT T G aQt C A G c!|A G h36b_ai2.seq 

CATGG AACCATTATTGTTGTAAGTAGATTTTTCT T'T CAATTCAACGATTT Majority 
641Q RA-yn c>on „- 



6420 



6367 


G AI| 


6367 


G AC 


6314 


GAT 


6313 


GAT 


6379 


Bad 



GAACCATTATTGTTGTAAGTAGATTTT 
G G A A C G A T T A T TGTTG. TAAG.TAGATTTT 
I.CAACC ' ~ ~ - - - - "~ 




6440 
i 

TTCAAC 
TTCAAC 



A T T T 2603_al2.seq 
A T T T 18rs21_at2.seq 



: A TffljT T G T T G T aBgtAG ATTTTTCTTTC aTt TCAACGAT T T 515^12?^ 
nTTigT TgTllG.T A|!1|tIIg A T T T T T C TTT C A A T T C A A C G A T T T cjblll al2.seq 
' * BBS AT TBTTHt A A||- * * ^ ~ ~aw~ " ■ " - wm ~- ■ 



T A G A T T TH T CIBH C A A|S T CAACGATTT h36blai2.seq 



GGTACTTTCCTfTTAATTTTCTAGTCT T. A AAAACAA-- 



-TACCTTTG Majority 



6460 



6470 



6480 




C C c T T T| 

... C C T T T| 

TT C C THT T A AJTT Tl 
TTCC TT T T A i 
TTCCTTTTA 




.6490 



GTGT T 
G T G T T 



TCTGTTCTCAATCC - ■ 
6510 



6520 



A A A A C A A 
A A A A C A A 
AGTGTTAA A DUMB A 
AGTG T T A A A A A C A A - - 
A G TH T T A A aHH A A - - 



6530 



t; i r 




t; r r 


IIlTAfl 




2603_al2.seq 

18rs21_ai2,seq 

5l5_ai2.seq 

cjblll_al2.seq 

h36b_al2.seq 



T T T A A G MaJ or i ty 



6540 



655*0 




TGTCAATCC 
TGTCAATCC 

HA A]"" 
A A 

gttgtcaa! 



A lr A ( \ ( 


i. t..t I 


1 I A 1 


r a a • 


A 1 


1 \ t 


i i; i 


1 A | 


A V, A C A I 


" (. 1. [ t 


I I' A 1 


I" A A 


' A 1 


i a r 


I ti i 


I A 1 



T T 
T'T 



G 2603_al2.seq. 

G 18rs21_ai2.seq 

TTT. A A G 515_al2.seq 

tQDa A G cjblll_al2.seq 

TTTAAG h36blaI2.seq 



AACACCATCTT.CATTAATTTCTGTTGGTTTTAATTTGT TA -CTTCCTTAC Majority 
R*firt cw" ' 6580 ~™ 



6517 
6517 
6428 
6427 



AACA- 
AACAL 

Baca c < 



6560 

A C C A T C T Tl 
AJ^C A T C T t| 
C A 




6590 



| T T A A T T T G T T A - C| 
T T A A TTT GTTA-C 





2603_ai2.seq 
I8rs21_ai2.seq 

«.A C 515_ai2.seq 

C T T CCT T A C GjblIl_aI2-seq 

t A - C T T cHHt A C h36b_ai2.seq 



CACTTTCCGTAATGTATTTTG T TT T A T C T T 

~ i — ' ; 1 ; : — ■ — n — : 



6620 



6630 




C A T T C AT A A C 
. 6640 



A I I' A t 
\ ( { A I 



TT.T T A T C T T. Cj 
l&nyyW T C T T Cj 



wq tJB a t T t ? & - V 
l^.lffl^A.iifT A T T T T^G 



A A A C A A* A G C A C CT TCGATTTCTT T A G A f r. - 



6jB16 
6616 
6521 
6520 
6571 



6663 
.•6663 
6571 
6570 
6618 



6660 6670 

n ^-t; : L-l 

A A A G A AO GC.ACCTTCGATTTCTTTAGAT 
A A A G A AUG C A C G'T f X GATTTCTTTAGAT 

aaa g||a agcag cjrijH£G at t t<: t t TBcDLll 

AAAGAA. AG C A C cWflDI GATTTCTTT Acf^ 
A-AAGAA AG C.A C CBJCG ATTTCTTTAGA 




[ikT-T C A-T Tl 

T T c a<t bsl 

; T TG AT TC 




A G C Majority 

. 6650 
i— 

A G C 2603"jal2.seq. 
ACC 18r^2Iv_ai2,seq 
-■-•A G G al51.ai2.seq 

aH[ C cj bl 1 l_ai2 -seq 
G p 'h36b_al2.seq *- . 



C CTTCG CC AAA CTA.AGC T G Majority 
*' 6700 



6690 
I 

TCGCC A A A G T A A C C 
T C G CC A A AG T* A AC C 
G C C A A .A..G T A 



ICC .A A A 
C C A A A[ 



T A A] 

t.a| 




2603_ai2.seq 

18rs21_al2.seq 

515_iai2.seq 

cjb^lijai2.4jeq : 

H36b_al2,seq 



TAAGGTCAGTAAT T^T CTTT A C*C -TATGTAGTCTTTTCCATTCAT ACCTTTT Majority 
6710 6720 6730 6740 6750 



AAGGTCAG 

A A G G T C A Cj 
T A A G G T C BH BB ATTTGTTTACCj 
T A A G G T C Afl T A ATTTGTTTACC 

BJaaIIctcagta aH t t|I t tH A C C 





TGTAGTCTTTXCCj 
TGT AGT C TT T T C Cj 
G T A G T C| 
T Ap 
T *A 



2603_aI2.seq 
18rs21_al2.seq 
TTT 5I5_al2.seq 
TACCTTTT cJbllUal2.seq 
C CH TTT h36b_al2.seq 
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G'T T C C jgjgjQj Ajj A.G jg gflgp E l fcEig AC-CfTCTCTTTCCACA ATCTT Ma.orltO 

f3f2 CT . 70 6780 6730 6800 

CTW G^CC^r?A^?^^ TT;<iT || A A H * T ^^B 1 TDB C T T T cfiAiA AICIT 2603 al2.sea 

GTGCAACCTCA CTGTAT TAGT T C T - - - TCCTTCATCCCCAAACC Majority 

68 . 10 gSgO 6830 68 40 6850 

6763 "SclllS?cic?2?i?Ti??-- J S i ~ - - - TG C TTCAXCCGCA A ACG 2603^12.^ 
«wt>j i» 1 u -l. A Afa GTCAC-TGTATTAG T - - - T G T - - TGCTTCATCrrrAdArr ift^oi al , _ 

6668 GTGCAAGGTCACTG tQ tH A G tQ3Q t|] T T C C T T cHl C C € C A ! A H K 

6667 G T G C A A G G T C A CDG T A tHa G T^~t" T - T G cIHt cIlCC G C AAA fcT Mt, 1? 

6718 OTcDaa-CCT||aCTCtHt?ACT- - -DO TEOlIinaiT C-S CBG C a AAmK:^ 

CTGCTCCAACTCAGAACAATG ACGTTAAAGTCAGTAACAATCCCCAGAAC Hatoritv 
68 t 60 6870 6880 6890 6900 

f!°J ffTFr^TFTtTSll A fi C A A TGACGTTAAAGTCAGTAACAATCCCGAGAAC 2603 al2.se*, 

G A, f^ A A I « A £ G T T A nH G TCAGTAACAATCCCGAGAAC ISrsiLal^L, 

C W G C W G C M * ■£ w A A « A B T 1 G A C G T T AABHtc ART aBc aHt CD C G A G A A C clbUl air seq 
6765 CDC CBC CBA C|I C A&A A C aTHg A|lGTTA aTHtBaBt A AC A AT G C C G A G A AC Web"^.^ 

ATTGCAAAATATTTGTTCAT TCTTTTCATTTCTATCTCCTTCTTATTTTA Balnrlfv 
• ' 69 . 10 69 . 20 6930 6940 " 6950 

6854 ATTGCAAAATATTTCTTGATTCTTTTCATTTCTATCTCCTtCTTATTTTA 2603 al2 sea 
68S4. AT T G C A A A A" T A T T T G T T ,G A T. T C T T T T C A T T T C T A T C T CC T T CT T A T T-T T A IS^J? 
6762 aQ T CCA A A AT ATTTGTTGAT tBJ TTTTCATTTCTATCTCCTTCTTATTTTA 515^12 seq 
6758 A TT CBA ABABA T T T G T T.G A T t|Jt TT T CATTTCTATCTCCTTC T T A T tHt A cjbll alTsea 
681SATTCCAAAATATTTGTTG.ATTCTTTTCATTTCTATCTCCTTCTTATTTTA. ^al^se^T 

GTT * ATCAACATGATTAAT AATATG C G G A TTT T A AT A -C-ACCCCAGCAC Majority 
. 69 . 60 6970 6980 6990 7000 

G TTAA.TCAACATGATTAATAATATGCGGATTTTAATA-cl ACCCC ACCAC 2603jl2.se, 
^°i G II AA I CAACA 2; GATTAATAATA T«CGGATTTTAATA-C-ACCGCAGCAC ISre^Ia 2^eq 

° 11 1 A J C A A C A T MM^ T A A T A A T A T G C GGATTTTAATA-C, AC C C C A C C A C 515 a I.seq 
6808 GTTAATCAACA T BKJJ T A A I A A I A tBc CC A TTBl A A t A OH - A C C G C A G C A C clblH al2sea 
6865 C T T A A T C A AC A t|a|t A A T A A T A t||c G G A T TQ T A A T A^Ja C C G C A G C A C mSU^^ 

CACTCCCTTCAACTCATGCAATTTTAG TTAATTAATTAAiSAATACTAAAG Majority 
, ™^ .7020 . 7030 7040 ~7050 

™S ^l^^^I CAAGTCATGGAATITTACTTAATrAiTT """A<:TAAAG 2603 al2. S eq 

CAC I CCGTTCAAGTCATGG AATTTTAGTTAATTAATTAAGAATACTAAAC 18rsil_al 2 1L 
CAC ^ CC ^n C AA£ii&Ai G | G AAT ^ TA ^ TTAATTAATTAA CAATACTAAAG 515 Ztf.^ T 
e1?4 C A C ? C cBt T C ^IRFK^JL^ILJ. A A T T A A T T A A G A A T A C T AA AH cJbTl^aia^ea 
6914 C AC TC CUT TC A AEKW A tMHHBBBBHBI aB T T A A T T A A||A A iflc T A A A C H36b_al2.seq 

CCCATAATTTTTAATCTTTT TTCATGGACATATCACTAGAT TT CTIATAC Majority 

. . 70 . 60 7070 7080 7090 7100 

™S ff^Ti A t T T l T *■* A l C t T T " T C * T C G A c A T A t C A C T A C A T T T C T T A T A C 2603_al2.seq 

J2£ ^ ? Jwi I Bft? J * * " 1 1 T t "«' A ^« *£-A T A T C ACT A G A T T T C T TAT AC SlS.alllse^^ 
1 ^ITm^IHv*! M C T T T TTMBnTnGAniTA T C A C T A G A t T T C t T A T Afl cJblUji^soq 

C T T T T C C A A A TAT A A A T T C C.A C C T C C A A T A G A C A I.CA .T AC C T C C A C C T A T Majority 

- " ' - . 71 . 10 • 7130 7140 - 7150 

™^ C nn CCAAATATAAATTCCACCTGC A A T ACACATCATAGCTCCACC TAT 2603_.al2.seo 

5 1 1 T T S C A A A T A T A.A A T T C C A C C T G C A A T A G A C A T C A T A G C T C C A C C T A T 18r*2lJlHeq 
6960-CTTTTCCAAATATAAATTCC ACCTGCAATAGACATCATAGCTCCACCtXt 515 a 2?sW 
|949 PTTTTCCAAATATAAATTCCACPTGCAATAGACATCATAgHtCCACCTAT cjbm alTseq 

7002 cttttccaaatataaattccacctgcaatagacatcatagctccacctat laebjr^se^. 

TAAAATGAAAGATAGAATTC CTTTCCCACCTCTCATAGGAATAATTCCTT Majority. 

7t . 60 7170 7M) • 7190. 7200 

7J02 T A A A A T G.A AAGATAGAATTCCTTTCCCACCTGTCATAGGAATAATTCCTT 2603_al2. S eq 

I AA f A I GAAAGATACAATTCCTTT CCCACCTGTCATAGGAATAATTCCTT 18 ra 2U.al2!seq 
It™ t» AAA t ^ G ^ TA CCAATAATTCCTT 515_al2.seq 

Sfc! TAAAA TGAAA.CATAGAATTCCTTTCCCACCTCTCATnCGAATAATTCCTT cjblll al^seq 
7052 T A A A A T G A A A G A T A G A A T T C C T T T C C C A C C T C T C A Tfl C G A A T A A T T C C T T M6bua72 seq 
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TTG G 



t prf^f^fjf^ c- ffiy*»T* tgcttgt c accttcctcatga 

72 t 10 7220 '" 7230 7240 7250 



■ 1 , ' , 

7060 ^CG?rr^A?^r^^J TGCTAATTAAATCCTTCTCACCTTCCTCA TGA ISrsil al_1L 
70S ??cc^rrMT A ^^i^ TTCCTAATTAAATGCTTCTCACCTTCCT CATG ASIS ali.s^ 
7049 TTGGTGGAATATGCGTGTTGGTAATTAAATGCTTGTCACCTTCrTr»-rr « , lt ,7. . To 3 

7t02 t.tggtggaatatccgtgttggtaattaaatgc"g^cacc??cc?catga ^"ll'seT 

TATTCAGAAATC TGT TTATTAACACCTATTATATTTTTTATCGATCCTTT Majority 
J270 7280 7290 7^00 

7202 T * T TCAGAAATCTGTTTATTAACACCTATTATATTTTTT ATCGATrrTTT ?fim 

7202 TATICACAAAICIGTITAITAACAOCTATTA ilfll J c " "J ^^l\TL. 

7110 TATTCAGAAATCTCTTIATTAACACCTA.TTATATTTTTTATCGATCCTTT If^Jf™^ 

7099 TATTCAGAAATCTGTTTATTAACAGCTATTATATTT tHt A TCGATCCT^ 3m??.!? 

7,52 TATTCACAAATCTGTT T-A TTAACAGCT'ATTATATTT T_J TATCCATCC T TT ^"alll^T 

AAC-CACTTCAAAACTTAAAATTG G TTT'ATTAGTAATTTTTTGATAATCGT Majority 
73 . 10 732a 7330 7340 7^0 

7l2 _A C CA^TT_A^AArT?M A h^ 

7160 A * c J_ AGTTCAAAAGTTAAAATT-GGTT. TATTAGTAATTTTTTGATAATrrx <:i«c . 
7149 AAcBactTCAAAAGTIAAAAIICUTIAITA I AtTITIHAI T cJ ^m' 

7202 aa cHacttcaaaagttaaaattggtttattacta A T t^ 

CCCGCGAAACTCCTTCTATTAA C TCATATTTCCCATCTTTCAAATCTTTG Majority 
7360 7370 7380 7390 7400 

™l CCGGCGAAACTGCTTCTATTAACTGATATTTGCCATCTTTCAAATCTTTG 2603 a!2 ^ 
7302 CCGGCGAAACTGCTTCTATTAACT-GATATTTGCCATCTTTCAAATCTTTG 1IS2T 
7210 C q C G C G A A A C T G C T T C T A T T A A C T G A T A T T T G C C A T C T T T C A A A T C T 1 T G S_T_ ? ^ ^ 
7.99 Dec GC G.A AACIGCTTCTAItAACIGATATITGCCAICTTTCAAAICITtC SI ^ 
7252 HcGGCGA'AACTGCTTCTATTAACTGATATTTGCCA-TCTT. TCA-AATCTTTG '^ebilaT^seq^ 
TAA CA'AATTTT C' C C G T T T TCTCCC GTCA C T ACTTTTGAATTATTATTTTT Majority 

74 . 10 7420, 7430 7440 7450 

75,52 " rAAC *.*ATTTTGC.CGTTTT_TCCCGTCACTA C T TTTGAATTATTATTTTT 2fl» „.» ™ 
7352 TAAGAAAT T'T T-G CCGTTTTCTCCCGTCACTACTTTTGA'ATTATTATTT TT tSST I 
7260 TA 'A GAAATTTTGCGCTT T. TCTCCCGTCACT ACTTTTGAATTATTATTTTT iSt _f±^ 
7249 TAACAAATTTTGCCC.TTT TCTCCC GTCACtACITtTGAATTATTATTITT f hit, ^ M 
7302 TAAGAAATTITCCCCrTTT.CTCCCG T C A C T ACTTTTG AATTATXATTTTT tX^X 

TATTCCTAAATAAAGTTTATAATCT TCATTAAATTCTTCAACTTCAAACC Majority 
. 74 *° 7470 . 7480 7490 " 7500 

74o2 ? A ^^? AAA ? AAA ^ir A j AATCTTCAT TAAA TTC TTGAAGTTCAAACG 2603_al2.se, 

73W ?A?Trr?t AA J AAA ^n A I AATCTTCATTAAATTCTTGAAGTTCAAA CG ISrs-La^.seq 

K99 tatJgCT A A A? _ _ A I AATCTTCATTAAA TTCTTGAAGTTCAAACG 5lS_al_. Seq ^ 

7299 T A T T G G T A A A T A A A C.T T T A T A A T C T T C A T T A A A T T C T T G A A G T T C A A Arc ,-ihTi i 

7352 TATTCGTAAAXAAAGTTTATAATCTTCATT^ 2S^LT 

TAGCTCCTTTCACAAGC A A CT TATTATTATCTT T.A- T C A A C T T T T A T A A A T Majority 

7452 
7452 
7360 

) T C — A * T T ° A 9 C T A A C TT C TTCT CGTTTTTAATCGT T.A TTGT A-G G A T A T T C l^fority 
7 y° • 7570 „ " ■ 7580 • 7 590 ■ 7600 

7502 TCAAT T.T..C A C. C T A AX TTC T T -C TCGTTTTTA ATCCTT A-T TGTAGGATATTr ift^i 

■££ TC^TTCACCTA^?^ 

ItAATTTCACCTAACTTCTTCTCCTTTTTAATCCTTA t *r r t a r i «r a t~ -r» r> T„ , ,~* 

7452 TCAATTTCACCTAACTTCT^ 

T C T C A C AT CACGAATTTTAGGGATT G C A- A AATCTCTA ACTGUTT ACC AT Majority 

76 . 1Q 76 2 0 7G3Q " 7640 ' 7650 ' 

IfS iSI^ ACATCACGAATTTTA ^ CGAT TGGAAAATCTCTAA.GTGTATTAGGAT 2603 a!2 sea 

74^ IpIp^ A I GAGG M TTTTAGCCATTGGAAAAT ai2.seq 
™^ TCTCACATCACGAATTTTAG.CGATTGCAAAATCTCTAACTCTATTAGGAT cibTll ai2 sea 
7502 TCTCACATCACGAATTTTAGGGATTGGAAAATCTCTAAGTGTAtBIggB MGbaT^r 
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CCTC 



7660 



7670 



7680 



ATTACTGTCATAGAATTTG Majority 



7602 
7602 
75 10 
7499 
7552 



7652 
7652 
7560 
7549 
7602 



. C C T CTGACTTACGATTCAATGTTGTTCTACCATTAGTG 
CCTCTGACTTACGATTCAATGTTGTTCTACCATTAGTG 
C C T C T G AC TTAGGATTCAATGTTGTTCTACCATTAGTC 
C C T C T G AjT TAGGATTCAATGTTGTTCTACCATTAGTG 
C C T C T G Afl TTAGGATTCAATGTTGTTCTACCATTAGTG 



7690 



7700 



T T 



ACTTA TAAAACTGTCATCTAGTTTCACATCATATCT 



7710 



7720 



TCATAGAATTTG 2603_al2.seq 
TCATAGAATTTG 18rs2l_al2.seq 
TCATAGAATTTG 515_ai2.seq 
TCATAGAATTTG cjbIU_al2.seq 
TCATAGAATTTG h36b_ai2.seq 

GAGTGTTACTTT Majority 



7730 



TTACTTATAAAACTGTCATCTAGTTTCACATCATATGT 
TTACTTATAAAACTGTCATCTAGTTTCACATCATATGT 
TTACTTATAAAACTGTCATCTAGTTTCACATCATATGT 
T T A C TT ATAAAA-CTGTCATCTAGTTTCACATC ATATGT 
TTACTBATAAAACTGTCATCTAGTTTCACATCATATGT 



7740 



7750 



GAGTGTTACT TT 2603_ai2.seq 

GAGTGTTACTTT 18rs21_al2.seq 

GAGTGTTACTTT 515_ai2.seq 

GAGTGTTACTTT cJbtU_al2.seq 

GAGTGTTACTTT h36b„ai2.seq 



TTCACC T T C 



7702 

7702 TTGACCTTCTCCTAAATT 
7610 



TCCTAAATTCAAA CC TCTAACATACACTTTATTTCCCATGT Majority 

7780 7790 7800. 

ACATAGAGTTTATTTCCGATGT 2603_ai2.seq 
ACATAGAGTTTATTTrrnA-rfiT io«.oi 



7760 



7770 



115 ACCTTCTCCTAA ATTC A A ACCTCTA_ _ 4 , , n t A , ^ ^ A t s ^ , t , 

AAACCTCTAACATAGAGTTTATTTCCGATGT I8rs21_ai2.seq 

(l A C. T T T A T T T r C r a t- r- -r c ■ «- _ ,~ 



I I G A C C i T C i C CTAAATTCAAA.CCTCTAACATACAGTTTATTTCCGAT 515 all s^q 

- --C CBT CMC C T A aWt T C A A A C C T C T A A c|Jt A G A G T TT A T T TflDG A T G T o bU al?sea 
C CHT cHc C T A aHt T C A A A C C T C T A A cHf A G A C T T T A T T tIHg A T G ^ h3^ ail ^ 



T T G 
TT G| 



ATTCTAATTTAACCCCCTTAACTATTCCACCATC 



7752 
7752 
7660 
7649 
7702 



ATTC 
A T fx 
ATTC 
ATTC 
ATTC 



T A A T 

T A A T 

T A A T 

T A A T 

T A A T 



7810 



ATTATTAGGCCCACCA Majority 



7830 



7840 



7850 



CTTCCAAT 



T^*n^^ TTAAGTATTCCACCATCATTATT -iGGCCCACCA 2603 ai2.seq 
^M^^^ TAAGTATTCCACCAtCATTATT B^CCCACCA I8rs21 alCeq 
TTAACCCCXTT A ACT A TTCC ACC ATC ATT ATT AGGCCC ACC A 5 15^ 12. seq 
ii AA ^ CC H TTAAGTAT ^ CCACCATCATTATTAGGC CCACCA ojblll alTseq 
TTAACCCCHtTAAGTAtDcCACCATCATTATTAGGCC CDC C A h36b al2 seq 



T" ------- . M ^ u n v, n m. x, tx i 1/14 i «. u v» ^ u igj u u A n36b_aiZ 

ACCATCCTTCATTACACTTC CATCATTTCCCTGTAAAGTATA Majority 
7860 7870 7880 7890 7900 




7902 
7902 
7810 
7799 

-7852 



7952 
7952 
7860 
7849 
7902 



C A C 

C A C 

C A C 

C A C 

C A C 



C C A T A G G A T C T T C : G A T A,G T T C I C A T T A A C A A T T G A G f T T T C T T T T G T T 2603 ai2 sea - 
C C AT AftC ATCTT C G ATAGTTCCATTA AC AATTG AGTTTTCTTTTGTT I8rs2l ai2sea 
C C A T A G G A T C T T C G- A T A C T T C C A T T A A C A A T T G A C T T T T C T T T T G T T 515 ail sea 
CCATAG.G ATCTT CH A TAGTTCCATTAACAATTGAGTTTTCTTTTGTT clblli alTsea 
GGA ^ ACG ATCTT A TAGTTCCATTAACAATTGAGTTTT. CQ TTTGTT ^6b^^ 



A- A AATCGTTTCAAATT.GTTi 



'GCTGA ATTTTAGATAAAATTTC ATTGTTAGA Majority 
• • . BQ }° 8020 8030 8040 8050 

A M A IS GTT ^ CAAATTG ^ TGCTGAATTTTAGATAAAATT TCAfTGTTAGA 2603 al2.seq 
AAAATCGTTTCAAATTGTTGCTGAATTTTAGATAAAATTTCATTCTTAGA ISrsil al2se 



A « TTTCAAATTGTTGCTGAATTTTAGATAA A- A TTTC ATTGTTAGA 515 ai2 seq 

TTTCAA A TTGTTGCTG A ATTTTAGATAAAATTTC ATTGTTAGA cjbTll al2seq 
TTTCAAATTGTTGCTGAATTTT AG A.T A A A AT T T C A T T||T T A G A h36b_al2 seq 



A A A A T C 
A A A A T C 



TCTAT CCGCTGAACTTACCATAGGCGTGTAGTACTCAGCTTTGCAAGAGA Majority 
5222 fl0 . 70 " 8080 8090 • . 8U)6 

2? IH A !^ CGCTGAAGTTACGATAGGGGTGTAGTACTCAGG TTTGCAAGAGi 2603 ai2.seq 

ITn J G I A I GGGGTGAACTTACGATAGGGG TGTAGTACTCACGTTTGGAAGAGA ISrsifa l^eq 

l*£ I nJmi'l Z r G A A G TT A C « A T ^ G GGGTGTAG TACTCAGGTTTGGAAGAGA51S aliJeq ^ 

SS ^A^C^G 0 ^^^ 



T A C T C A G G T T T| 



cjblll_ai2.seq 
G A A G A G A h36b_al2.seq 
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Weighted residue weight table. 
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AC G A C 



<p pg__r_A |E pnt^ g-A';:* TOT 3 em t c ~ T^ aaactt taaaagcttcc 

81 J 0 8120 8130 



Majority 



8052 
8052 
7960 
7949 
8002 



8102 
8102 
8010 
7999 
8052 



8140 



A C G A C 
A C G A C 
A G G A C 



G A C 
G A 



^^^n^^^ ATTTCTCCATCTGAAAGTT TAAAACCTTC 
^^^^^3^ CATT1CTCCATCTGAAAGTTTA AAAGCTTC 
TCATTAGTTCTGTGATTT C T CCATCTGAAAGTTTAAAACCTTC 



8150 



TCATTAGTTCTGTl 
TCATTAGTTCTGTl 



A T T T C 
A T T T C 



ccatctgaaagtt) 

CCATCTGAAAGTT] 



AAA.ACCTTC 
AAAAGCTTC 



2603_al2.seq 

18rs21_ai2.seq 

515_al2.seq 

cjblll_ai2.seq 

h36b_ai2.seq 



TCTTTC A 



ATTTTTC 
8160 



TC-TTTCA ATTTTTG A 
TCTTTCAATTTTTGA 
TCTTTCAATTTTTGA 



AAAGTACCATCTTGATTTTTCTT 
8170 8180 



ATACTCCTCATT Majority 



8190 



8200 



CCATCT.TGATTTTTCTTA TACTCCTC ATT 2603 a!2.seq 

AAAGTACCATCTTGATTTTTCTTATAC " ~ 

A A AG T A C 



-TCCTCATT 18rs21_al2.seq 
T C T T T r a t t t t r , , * . CATCTTGATTTTTCTTA TACTCCTCATT 515 al2.seq 

T CH TTCAATTTTTr A AiirT A r CATCTT A E TCG ^CATT cjbllt alTseq 

T CUT TCAATTTTTGAAAAGTACCATCTTGATTTTTCTTAT aBt C C T C A T T h36b al 2 seq 



A T A A A 



8152 
8152 
8060 
8049 
8102 



8202 
8202 
8110 



CTTGTCTAAAACCAGATATA T C G ATACCAAAATTAAAAATGTCAT 

8210 8220 ' " 

x. 



Majority 



ATAAACTTGTCTAAAACC 



8230 



8240 



8250 



« 



ATAAACTTGTPT'i i a Ar AGA I A I ATCGATACCA AAATTAAAAATGTCAT 2603_.al2.seq 
A?AA^C^r^r?AA A Arr A r A i^ 18ts2l_ai2.seq 
ATAAACTTGTCTAAAACCAGATATATC_G ATACCAAAATTAAAAATGTCAT 



ATAAACTTCTCTAAAACCAGATA TAT c[l ATA CCAAAATTAAAi 
ATAAACTTGTCTAAAACCAGATATAT cQ A TACCAAAATTAAa| 



515_al2.seq 
ATGTCAT cjbll l_al2.seq 
A T G T C A T h36b_ai2.seq 



AATTTTTCTCTTTTAAACTATT T ATATAAAGTTTGCTTG GTGTTCC ATGT 



8260 



8270 



Majority 



8280 



8290 



AATTTTTCTGTTTTAA 
AATTTTTCTGTTTTAA 
A A T T T T T CTGTTTTAA 

AA'TTJTTCTGTTTTAAACTATTTATATAAAG 



A ^ A 33J ATATAAAGTTTGGTTG ^ TGTTCCA TGT 2603 ai2_seq 

r^II TATATAAACTTTCGTTGGKnCCA TGT 18rs21 al2.seq 
ACTATTTATATAAAGTTTGGTTGCTGTTCCATGT515 aii.seo 



TTTGGTTGG 



515_ai2.seq 
TGTTCCATGT cjbll l_al2.seq 



8152 AATTTTTCTGTTTTAAACTATTTATATAAAGTTTCGTTGGTGTTCCA'KiS^ 
• . yCTTTCACTGGTCCATT TCGATAAATTGTACCTTTAGGGTAATTAAGATT 



8310 



8320 



Majority 



8330 



8340 



8252 
8252 
8160 
8149 
8202 



8302 
8302 
8210 
8199 
8252 



8350 



TCTTTCACTGGT CCATTTCGATAAATTGTACCTTTACG GTAATTAAGATT 
TCTTTCACTCCTC.CATTTCGATAAATTGTACCTTTA G T TT 
ICTITCACTCCTCCATTICCATAA4ITCTACCTTTACC • 

actggtccatttcgataaaBtgtacctttagggtaattaagatt 

ACTGGTCCATTT cQ ATAAATTCTACCTTTAGGGTAATTAAGATT 



TCT 
TCT 



2603_ai2.seq 

18rs21__al2.seq 

515._ai2.seq. 

cjblll_ai2.seq 

h36b_.ai2.seq 



TAAATCTAAATAATCAAGTTTTTGTA 
8360 8370 



ACTTT CCAGACATTATCTCTGTTT Majority 



8390 



TAAATCTAAATAATCAAGTTTTTGTA 
TAAATCTAAATAATGAAGTTTTTGTA 
TAAATCTAAATAATCAAGTTTTTGTA 



8400 



TAAATCTAAATAATGAAGTTTTTGTA 



GTTTCCAGAGATTATCTGTGTTT 2603„ai2.seq 

GTTTCCAGAGATTATCTGTGTTT 18rs21_al2.seq 

GTTTCCAGAGATTATCTGTGTTT 515_.ai2.seq 

.TAAATCTAAATAATGAACTTTTTCTAAGTTTCCH A§_ ATTATC'TGTCTTT hSG^alElseq 6 ^ 

GATAA CTATCTAAGGGAAACAAAAAGT A a c 
i ; i 

8410 



8352 
8352 
8260 
8249 
8302 



8402 
8402 
8310 

8299 
•8352 



8452 
8452 
8360 
8349 
8402 



TCTCCCCATTTCCTTTTATA Majority 

. _________ , 8 y0 8440 8450 

GATAACTATCTAAGGGAAACAAAAAGTAACTC 



8420 



GATAACTATCTAArrrA a a r* a a a « a /. V V « ~ -TCCCCATTTCCTTTTATA 2603_al2.seq 
GATAA CTATCT'AAGGGAA «H ^AAA4rT AA ^ TG ^ C ° C C A * * T C C T T T T A T A 18rs : 2l_al2^eq 
G A T A A C T A T c I AAGGGAAA n A AAAAGTAACTCTCCCCATTTCCTTTTATA SisLal2.seq 
^ A I AA S ACGGAAACAAAAAGTAACTCT CCCCATTTCCTTTTATA ciblll ai2 sen 

GATAACTATCTAAG. GGAAAC A A AAAGTAACTCTCCCC AT T___ ; C GTT IflA T_jj h^Cb^ai-i^s^ 



TCCTCGGGCTTAT C AGTAAG T AG . A A A A T T A C T T T T A T T T A G A T A T C C A T T Majority 

84 . 6q f_™ 8480 8490 8500 

?CC?CGGCC?^?r A «I AAG TAGAAAATT;CTT.TTATTTAGATATCCATT 2603_.al2.seq 
I £™I ^55511 I^ A SI AA 5I A 5 AAAAT V A CTTTTATTTAGATATCCATT 515_al2.seq 



515_al2.seq 

A « A A AA1 1A £ TTTT i T I TAGATA TCCATT <Ubill_al2.seq 
ABlAllA A TT aHtTTT|8t|It AB AT A_BC cBt't h36b_ai2.seq 

TTTTTTCATTTGTTCAAATTGGCTTTCATATGATCCA 



TC.CT CJl G GJ^TJ? A T C 
~| TC , 



mm A G T JL A _£_JL AGAAAATTACTTTTATTTAGA- 

g cBtBat c a g tIIaI^* — ■ — - ^ — " 



8510 

ttttttc atttcttc 
ttttttcatttgttc 
ttttttcatttgttca 
ttttttcat t t g t t 

T TO T T T_B ATT 



8520 



8530 



C C'C AGTTTA AA AT Majority 



8540 



8550 



AAATTGGCTTTCATATCATGCACCCAGTTTAAAAT 2603_ai2.seq 
M^IJ^CTTTCATATGATGCACCCAGTTTAAAAT 18rs21_aiCeq 
AA " A ^^ GGG j^"^^ G ATATGATGCACCCAG'TTTAAAAT 5l5_al2.seq 
15 A A A I T G *£ZZ£ C A T A T G A T G C A C CCAG T T T A A A A T cjblll_al?seq 
T C A A A T T G|H T tB CO T A T Gfl T cfl A C IOT tBH A A A H h3Gb_.ai2.seq 
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"75/4*7 



" TT^Tl TOfejaiitfel^fg gH ^ cFAT c ACTTACATGAACAAT A^aj^ 

8570 — ' 6 
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8560 



8580 



8590 



8600 



^02 TATTAATAGCATATGATCTCGTACCAACACC 2603_a| 2 .seq 



8552 
8552 
8460 
8449 
8502 



ATTTTTTGACTATTTCGATTTACTTCACTCAAA A TATCATCTGCCTCCAT Majority 
86 . 10 8620 8630 8640 ~~^5 0 



8702 
8702 
8610 
8599 



8752 
8752 
8660 
8649 
8702 



8802 
8802 
8710 
8699 
8752 



8852 
8852 
8760 
8749 
8799 



8902 
8902 
• 8810 
8799 
8849 



ATTTT ^ ^ ^ ^ A 1 A 1 ^Ai^iocciCCAT 515_ai2.seq 

A TOT T Tmlc tH? Tile gWtB^ mm t S T A A A « A-TCTCCCTCCAT cJbUl.al^seq 



8602 
8602 
8510 
8499 
8552 



8652 
8652 
8560 
8549 
8602 



CAAGGCTTTCATAGTAA ATCTTT C TC CTACTTTACTAAGATACTACTCCT Majority 

86 70 8680 ftfion 870O 



TTTCTTCCTCTCCACTTAATCCATTr. G TACTACATCCCCACTTAC.r.TTTA Majority 
S*° 8720 8730 8740 8750 



T T T G T UCTCr»»nA^rr{?^;i!" UA KCCC;Cn »" ^ 2603_a 12 .se q 
TTTrTTffTrTrrir^I^ 5CIACTAC4TCCCCJCTIAGCTITA ISr^I a 12 .seq 

^?c^GC^c?GG^G??M?rr^^^ ACTACATCCCCACTTA ol5_alS.seq^ 
A _L 1 Jl 1 1 fa c ItTGCACTTAATCC AT TGGTAGTAGATCCCCACTTArrTTTA ^m. „ IO 

tQtQt tBDt CTGGAGTTAATC cfl_(SXS T |]G tDDDDc CCCACTTAGCT tD__ M6buai 2 !seq e< ' 



CGAGCTTCTCTCGGAATCCTTTTTATAATCTCTT C ACCATTATTTCTTAA Majority 
87 . 60 • 8770 8780 8790 8800 



GGACCTTCTGTCGGAATCCTTTTTATAATCTCTTCAGC A T TATTTCTTAa ' mm „ 

TTCTTTATCACTATAA TTCT CTC TC TGAATTCTCAACTTAGTTTGAACr. r. Majority 
88 2 Q 8830 8840 88*50 

;«???"sts;i?it?;??s;r cTcA * T ""*""" 

B T0T IT A TIM C T A T A A TIM I^I^CTGAATTGTGAACTTAGTTTGAAGGCcJbll I_a iTseq 
U TUT T T A TMflC TAT A A tOQtII T G T C T G A aQ T G T C A a|Bt AGTTtQaAG||c h36b_aT 2 .seq 

CATAATATTTATCATCT TC T T T A A A T C C T T T T AC G A C A TCTA f!A CTCCTA Majority 
8860 8870 8880 8890 " 8900 



CATAA^^o^lT^JrH^^^ 0 "" 40 " 18re 2 l_al2?L, 
i-AIAAlATTTATCATCTTCTTTAAATCCTTTTACGAC-ATCTACACTCrTA ,1? ™ 

CAT A AT aHHH aIHRI a t SA? C T T T A A A iS C C T T T T A CC A C ^ C T A C AC T C C * A cJolt^ttUq 
C A T A A T aHHH "EE A T efl T C I T T A A aH/ C C T T T T a|_Qa ell tBB A CDH t C C^^jteZg* 

C C A T C A A A A A T A T C T C A A C C A T A G G T A A C T A A T C C A A C C C T A t T A T C A C T Ma l.^,^ 
89 . 10 «j» 8930 8940 8950 

CCATCAA A HATATCTC AA ?r A TA G G TA A C T A A T G C A A C C;C T A T T A T C A C T 2603_al2.seq 
rriTri A i!!I A ^I^ ACCATASSIAAC ' rAtTGCA * CC "iTTATCACT 18rs21 alHeq 

CCATCft AA A A T A T^v^ AA ^^ A ^ AG ^^ AA ^^ AA ^^*' AA *'^'^ *^' A TTATCACT 515 ai^seq 
CCATCAAAAATATCTGAACCATAGGTAACTAATGCAACCCTATTATCACT r-lh7?i »|7«~, 
CCATCAAA'AATATCTGAACCA TlJ G T A A C T A aQ G cQQ c|B CQQt T_J T C aQQI ^"all!^ 

GTTTGCTCCTAAAATA TC TTT TACT CCCCTCCCAACACCTTCGGCACCTT Majority 
8980 SSTO 8980 8990 9^00 " 



r TCCCAACAC " TCCCCAC " 2603_ai2.se, 

GlTlGcl6cllA AA ZAZrZZZZ A nZ C Z CGTCCCAkGKGCTTCGGCKGCT1: 18rs21_al2?L, 

HtttGCTCTTaa-a aW a Z r ZZ Z T A AA&SA G T A£ C . A A C A C C TTCGCCAGCTT cJbllLalTseq 

ua AAA A U t. T T T T A —HI G TUU CO ADD G C T T C_H c|| G cH T h36b_ 
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9052 
9052 



8949 
8987 



9102 
. 9102 
9010 
8999 
9037 



9152 
9152 
9060 
9049 
9087 



9202 
.9202 
91 lO 
9099 
9137 



9252 
9252 
.9160 
9149 
9187 



9302 
9302 
9210 
9199 
9237 



9352 
9352 
9260 
9249 
9287 



T C T T G 



9010 



9020 



TGGCCCATCCTTATTCATTGAC Majority 
903O 9040 9050 



8952 
8952 



8849 



9002 
9002 
8910 
8899 
8937 



TCTTGGCTTTAT T A T G C C TT TG A A A A TT T C G G C C A TC GTT A T T C A TT G A 2603 a!2 sea 
TCTTGGCTTTATTATGCCTTTGAAAATTTGGGCCATCGTTATTCATTGAC iSt tlTtea 
TGGCTTTATTATGCCTTTGAAAATTTGCCCCATCGTTATTCATTCAG I8rs21_al2.seq 
M n G °m TT ra TT A T L G <L C . TT TG A A A A TTTG GG C c A Tn r.TT A T TC a tt r. a r. 



515_al2.seq 
_ - - - cJblU_ai2.seq 

T TH T T C A TfjG AH h36b_ai2.seq 



TTAGAATTATCGAGTACGA 



9060 



AGACAACATCTAACGGCTTTTCTTTGTCCAC Majority 
9070 9080 9090 9100 



— « — — i i __, 

TTAGAATTATCGAGTACGAAGACAACATCTAACGGCTTTTGTTTGTC 
TTAGAATTATCGAGTACGAAGACAACATCTAACGGCTTTTGTTTGTC 
TTACAATTATCCAGTACCAAGACAACATCTAACGG.CTTTTGTTTCTC 
T T Jm GAATTAT C _£ A G _i ACGAAGACAACATCT A AC GG CTTTTGTTTGTC 

t tH g a a t t a t cH a g|J a c_3 a aB a c a a c a t c|I aBB g cD tBO tH t t tBt| 



T G 



C T tttactatc gtttttccactcacacttaactcaattttatatt 

9120 9130 9140 



C A C 2603_al2.seq 
C A C 18rs2I_ai2.seq 
C A C 515_al2.seq 
CAC c|blH_aI2.seq 
AH h36b__al2.seq 



TAT Majority 



TGGTTTTACTATGGTTTTTCCACTGACAGTTAACTCAATTTTATATT 
TGGTTTTACTATGGTTTTTGCACTGACAGTTAACTCAATTTTATATT 
TGGTTTTACTATGGTTTTTCCACTGACAGTTAACTCAATTTTATATT 
TGGTTTTACTATCGTTTTTCCAC T G A CAGTTAACTCA ATTTTATATT 
B GOT T T T A||T ATCGBtT Tflc C|| C tQa CAGTTAACTCA ATTTTATATT 



9150 



TAT 2603_al2.seq 
TAT 18rs21_al2.seq 
TAT S15_al2.seq 
TAT cjblll_al2.seq 
TAT h36b_a!2.seq 



TATCAGCTAAATCACCTACTTCTCAAATACGTT.TACATA 



9160 



ATCTTCCCTCT Majority 



9190 



9200 



TATGAGCTAAATC ACCTACTTCTGAAA 
TA'TGAGCTAAATCACCTACTTCTGAAA 
TATGAGCTAAATCACCTACTTCTGAAA 
T A T G AG CT AAATCACCTACTTCTGAAA 
T A T G AO cH A A A T C A II TACTTCTGAAA 

GGAATTTCTCTTATATGCTCACCTTCA 



9210 



9220 



TACGTTTAGATAATGTTCCCTCT 2603_ai2.seq 
TACCTTTAGATAATGTTCCCTCT 18rs21_al2.seq 
TACGTTTAGATAATGTTCCCTCT 515_al2jseq 
TACGTTTAGATAATGTTCCCTCT cjbl U_aI2.seq 
TACGTTTAGATAATGTTCCCTCT h36b_ai2.seq 

CTTGAATATGGGTTAACTGCTTT Majority 



GGAATTTCTCTTATATGCTCACCTTCA 
GGAATTTCTCTTATATGCTCACCTTCA 
GGAATTTCTCTTATATGCTCACCTTCA 
GGAATTTCTCTTATATGCTCACCTTCA 
BGAATTTCTCTTA t|_| TGCTCACCTTCA 

TGCCTCTCACTTTCCATTTCCAACTCA 



9230 



9240 



9250 
-jt_ 



9260 



9270 



CTTGAATATGGGTTAACTGCTTT 2603_jal2.seq 

CTTGAATATGGGTTAACTGCTTT 18rs21_.ai2.seq 

CTTGAATATGGGTTAACTGCTTT 515__al2.seq 

CTTGAATATGGGTTAACTGCTTT cJbUl_.al2.seq 

CTTGAATATGGHtTAACTGCTTT h36b__ai2.seq 

ACCTTTAACATGCTCAAGTTTAT Majority 



9280 



9290 



9300 



TGCCTCTGACTTTCCATTTGGAACTGA 
TGCCTCTGACTTTCCATTTGGAACTGA 
TGCCTCTGACTTTCCATTTGGAACTGA 
T G C C T C TG A C TT TCCATTTGGAACTGA 

t g c c t c tHa||t tHc CATTTGGA aB T G A 



ACCTTTAACATGCTCAAGTTTAT 2603_al2.seq 
ACCTTTAACATGCTCAAGTTTAT 18rs21_ai2.seq 
ACCTTTAACATGCTCAAGTTTAT S15__ai2.seq 
A CC T TT A A CATGCTCAAG. TTTAT cjbll l_al2.seq 
ADUT TTAACATGCTCAA GB T T A T h36b_al2.seq 



A AGATTCCTTTGTATCTTCATAAATTCCTGTCGGGGGATACTGCTTATCT Majority 
,_ 93 , 10 - 9320 9330 9340 ^9350 



A AGATTCCTTTGTATCTTCATAAATTCCTG.TGGGGGGATACTGCTTATCT 2603 a!2.seq 
* * ° 4 I It S5. T T G T A T C TT C A t A A A T f C C T G T G GG G G G.A T ACT GCTTATCT l*r S 21_ai2^6q 
AAGATTCCTTTGTATCTTCATAAATTCCTCTGGGGGGATACTGCTTATCTS15 a!2,seq 
M^II'n TTTGTATCTTCATA Ailt C C T G TC'C.G C G GG AT AC TGCTTATCT cjbiuji^seq 
A AG A T T CD* TTGTATC.T T C A T A AflB^ C C T G tBBg GGG'ATAGTGCTTATCT. h36b all.seq 



AGTTCTTCCTGATTTT. GTCC 



9360 



. AATTG TGGAATTTTTATCACCACTATTTTG Majority. 
9370 9380 9390 9400 



A G T TC TTCCTGATTT T GT C C A ATTG TGG A AT TTTT,AT C A C C A CT A TT TT G 2603 al2.seq 

A ^I^^^ TGATTTTGTCCAATTGTGGAATTTTTATCA ^ **rs2! alHeq 

AGTTCTTCCTGATTTTGTCCAA T,T GTGGAATTTTTATCACCAC TA T T T T G SISaiis 



A G T T C TT CCT G AT T TTGWAAT TGTG G A A TTTTfATCACCACTATTTTG cjblll ai2s<^ 
A-GTTcBtcBtgBBtTTGtIIc AATTHTllG A0TTTTT ATCaBcACTATTTTG h36b_a7 2 ^ 

TAT CGTAGTTTT.TCCATTACTCTCAACCTTAACTTGCCAAGTCTGCT. TAG Majority 
9410 9420 9430 9440 9450 



I A I^ TUTTTTTCCATTACTCTCAACCTT AACTTGCCAAGTCTGGTTAG 2603 a!2.seq 
TATCGTAGTTTTTCCATTACTCTCAACCTTAACTTGCCAAGTCTGGTTAG 18rs21 aiHeq 
TATCGTAGTTTTTCCATTACTCTCAACCTTAACTTGCCAAGTCTGCTTAG5I5 ali.seq 

I A Io^ A !: TTTTTCCATTACTCTCA ^ cJbTlUai^seq 
TATCGTAGT T-T TTCCATT ACTCTCA AC CTTAACTTGCCAAGTCTG gQt A G h36b_a!2 -seq 
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9349 
9387 



9502 
9502 
9410 
9399 
9437 



9552 
9552 
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T C T T T ff ] k"T | AC ggg jg gg ggj M TCTGATAAACTATAATCTCCA MafoTF^ 

. ^60 9470 9480 9490 9500 

!£! TCTTTTTATAACCTTCGGGCCCTGTTTCTTCTGATAAACTATAATCTCci 2603 ai2.seq 

oolS TCTTTTTATA.ACCTTCGGGCGCTGTTTCTTCTGATAAAGTATAATCTCCA 515 ai2 seq 
SSS I « III ^ ^BB C C T T C G G C C G C T GTTTCTTCTGATAAACTATAATCTCCA cjbll 1 ai^seq 

9337 T C T TT T tBt a|.C C T T C G G O G c|1 G T T T C T T C T G A T A A AG t|Jt A A T C T C C A h36b_al2.se^ 

GGTATGAGATTATCA AAA CTAGCTTCAC CTCTTAGCTCAGCAGTTACTTT Majority 
95 t 1Q 9520 9530 9540 So 

9452 CGTATGAGATTATCAAAAGTACCTTCACCTGTTAGCTCAGCAGTTACTTT 2603 al2 seq 

**** GGTATCAGATTATCAAAAGTAGCTTCACCTGTTAGCTCAGCAGTTACTTT I8rs2i aizTseq 

9360 GGTATGAGATTATCAAAAGTAGCTTCACCTGTTAGCTCAGCAGTTACTTT 515 a!2 seq 

««q G G TAT GAGATTATCAAAACTAGCTTCACCTGTTAGCTCAGCAGTTACTTT clblll ai2seq 

G G TDlT CAGATTATCAAAAGTAGCTTCACCTGTT ABIC T C A-GB A GTTACTTT b36b_ai2.seq 

TTCTATTTTACTTTCTGGA TCACCAGTAGTTTTTAAAACAAAGCTAGCTT Majority 
9570 958Q 9590 9000 

TTCTATTTTACTTTCTGGATGAGCAGTAGTTTTTAAAACAAA-GGTAGCTT 2603 ai2.seq 

TTCTATTTTACTTTCTGGATGACCAGTACTTTTTAAAACAAAGGTAGCTT 18rs21 aI2 seq 

TTCTATTTTACTTTCTGGATGAGCAGTAGTTTTTAAAACAAAGGTAGCTT 515 ai2 seq 

TTCTATTTT A C TT TC TGGAT C AG C A G T A GTTT TT A AAACAAAGGTAGCTT cjblU ai^seq 

TTCTABtTtBcTTTCT gBBt g||gHa G T A CBTTTTA AAACAAAGGTAGCTT h36b_al2.seq 

TTCAAAGTCCT TTGTTCTGGTCATCTGTCTTTTTAACAACTAACTTTCCT Majority 
gg*0 ,9620 9630 9640 9650 

TTGAAACTGGTTTGTTCTGGTCATCTCTCTTTTTAACAACTAACTTTCCT 2603 ai2 seq 

TTGAAAGTGGTTTGTTCTGGTCATCTGTCTTTTTAACAACTAACTTTCCT I8rs21 ai2.seq 

9460 TTGAAAGTGGTTTGTTCTGGTCATCTGTCTTTTTAACAACTAACTTTCCT 515__al2 s6q 

9449 TTGAAAGTGGTTTGTTCTGGTCATCTGTCTTTTTAACAACTAACTTTCCT cjblU al2?seq 

.9487 TTGAAAGTGGTTTGTTCTCCTCATCTGTCTTTTTAACAACTAACTTTCCT h36b_ai2.seq 

TTAGCACCATTTTCCGGTACGGTACTTTCCCC T A A A AC A TT C GT ATT A AG Majority 
^ 96 t 60 Q670- 9680 , 9690 ~~ ^0 

TTAGCACCATTTTCCG.GTACGGTACTTTCCCCTAAAACATTGGTATTAAG 2603_lai2 seq 
TTAGCACCATTTTCCGGTACGGTACT T-T CCCCTAAAACATTGGTAT T A A G 18rs2i^i2^eq 
TTAGCACCATTTTCCGGTACGGTACTTTCGCCTAAAACATTGGTATTAAG 515_ai2.seq 
TTAGCACCATTTTCCGGTACGGTACTTTCCCCTAAAACATTGGTATTAAG cjblU ai2 seq 
TTAGCA.CCATTTTCCGGTACGGTACTTTCCCCTAAAACATTGGTATTAAG h36b_al2.seq 

CGGTATTTGCCACAAACAAAAAAGACTTAACGTCAATATTTTACAAAATT Majority 
9710 9720 9730 9740 9750 

CGGTATTTGCGACAAACAAAAAAGACTTAACGTCAATATTTTAGAAAATT 2603 al2.seq- 

9652 CGGTATTTGCGACAAACAAAAA AG ACTTAACGTCAATATTTTAGA AAATT 18rs21 ai2^eq 

9560 CCGT.ATTTGCGACAAACAAAAAAGACTTAACGTCAATATTTTAGAAAATT 5I5_ai2 seq 

9549 CGGTATTTGC G A C AA ACAAAAAAGACTTAACGTCAATATTTTAGAAAATT cjbUI al2.seq 

9587 CGGTATTTGCGACAAACAAAAAAGACTTAACGTCAATATTTTACAAAATT h36b_ai2.seq 

TTTGGTATTTTCTCATTTTACAAC.TCCTATTGTGCCGAAATGTCGTTTCT Majority 
9760 9770 9780 , 9790 9800 

9702 T T T G G T A T T T T C T C A T T T T A C A A C T C C T A T T G T G C C G A A A f G T C G X T T C T 2603 ai2 seq 

9702 T T T G G T AT T T T C T C A T T T T A C A A C T C C T A T T G T G.C C G A A A T G T C G T T T C T lSc^l^i^seq 

T T-T G G T A T T T T C T C A T T T T A C A A C T C C T A T T G T G C C G A A A T G T C G T T T C T 515 a!2.seq 

T T T.G CT ATT TT CTCA TT T T A C A AC TCCTATTGTG-CCGAAATGTCGT.TTCT 6lb|ll^ti.seq 

T TtiGGTA T TT T C T C A.T JT T A C A ACT G G T A-T TGTGC C G : A A A T GT.C GTTT C T h36b.jal2.seq 

A A A T CTA A G A T C AG A TACA.C A A T A TC C T A C A A T AT A C A A AC TA TC AC TT A Majority 

J 9810 9820 9830 9840 - 9850 

9752 AAATCTAAGATCAGATACAGAATATCCTAGAATATACAAACTATCACTTA 2603„al2.seq 

9752 AAATCTAAGATCAGATAC.AGAATATCCTAGAATATACAAACTATCACTTA 18rs21 aliTseq 

9660 AAATCTAAGAT.CAGATACA.GAATATC. CTAGAATAT ABA A ACTATC ACTT A 515^ai2.seq 

9649 AAATCTAAGATCAGATACAG A.A TATCCTAGAATATACAA AC.TATCACTTA cjblll^i2seq 

9687 AAATCTAAGATCAGATACAG A ATA TCCTAGAATATACAAACTATCACTTA h36b_ai2.seq 

TTATGAT.ATCAA TAATTTCTTATTATAA G GTATG GAATTTT AATGTTTTT Majority 

9860 9870 9880 9890 9900 

9802 TTATGATATCA A.T AATTTCTTATTATAAGGTATGGAATTTTAATGTTTTT 2603 ai2.seq 

9802 TTATGATATCAATAATTTCTTATTATAAGGTATGGAATTTTAATGTTTTT I8rs21 al2/seq . 

9710 T T A T C A T A T C A A T A A T T T C T T A T T A T A A C G T A T G G A A T T T T A Aflc T T T T T 515 al2.seq 

9699 TTATGATATCAATAATTTCTTATTATAAG.GTATGGAATTTTAATGTTTTT cjbUI ai^seq 

9737 T T A T C A T A T C A A T A A T T T C T T A T T A T A A G G T A T C G A A T T T T A AflC T T T T T h36b a72.seq 



9602 
9602 
9510 
9499 
9537 



9652 



9702 
9610 
9599 
9637 
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t c c c a m r;r W y^ m] m ^r,t »"TO S^TTTt g u AAY cttat t TTTT 

". 10 S920 9930 9940 



A T Majority 



9950 



mil TCCCAATTTTTr A at AA ^ET. ^^^^^^^^ A ^^^^ ATAATCTTATTTTTT A^T 2603 a!2 seq 

9zS ?crCAA^^^MJ«^ TTTTTCTTTTTATTTGATAATCTTATTTT T T AT 18rs2~i aiHeo 
J760 TCCCA A TTTTTGAA tQ ATTTTTCTTTTT A TTTCAT A MM rR tVKI T tW 71X13 c,r |- aiz: - se ^ 

9749 TCCCAATTTTTCAATAATTTTTCTTTTTA "a ™ 7^ T T T^TF? f M , ^ 

9787 TCCCAATTTTTGAATljATTTTTCTTTTTATTTGATJurTnjrT^TuS 1 ^ 

TATCTTACAAATATTTCAATGAGCTTAACT A GTTCATTTTTCTTTTTTTA Majority 
22 "7° 9980 9990 10000 



9902 
9902 
9808 
9799 
9837 



TATCTTAG AAATATTT C-A A T G AGCTTAAGTAGTTGATTTTTC TTTTTTTA 26m a i? ^ 
TAT C T T A G A A ATATTTCA ATGACCTTU C T AC TTC ATTTTf C TTTTTT n 22? flT2 

tatcBtBgaa aHB tB tII a a tD ' BB tBaBctactt Gmm t tOT IWTOTO m si ; f ^ 

TAT.CTTAGAAATATTTCAATG A™ TTAAGTAGT oTTTT T ™ T "f f f A e itfl ^ 

tat.cttagaaataBttcaatBagcttaagtact T t SShS^T 



TGTTTTAAAATA TTCCTTAAAAATAATGTTTCACAC 

10020 10030 



looio 



9952 
9952 
9856 
9849 
9887 



TGTTTTAAAATATTGCTTAAAAATAATCTTTGAGAGAG- 
TG T T T T AAA AT AT TG C TT A A A A A T A A TGTTTGAGAGAG - 

th t t td adddq ah tb cua A ab nmm tQid thb c a n A n - 

TCTTTT A A AATATICCTTA AAA™" "™CUAC 

tgttttaaaatattgcttaaaaataatgtttgacagag 



- -TTTACTGAAT Majority 
10040 10050 



TTTACTGAAT 2603_ai2.seq 

m T T A C T G A A T l8rs2Ual2.seq 

H T T A CBBD A T 515_ai2_seq 

- T TTACTGAAT cjblil_al2.seq 

TTTACTGAAT h36b„ai2.seq 

TCATTGAAAATTATTTACAAAAAGACATCCT T AATCAAATAAAACTTCTA Majority 

!£22 ™™ 10080 10090 loloO 

TGATTGAAAATTATTT 2603_ai2.se q 

? | aH tB t t HOD aB aB aBBI cDl tDDDB a a tB a aQ tB a a 'JMMk'SS^^ 

ACTTTATGCTATGATTACTACCCTT CCATTACTCTACACAAATCATCTCA Majority 
10 . 110 JJHf! 10130 WHO ,oiso 

9947 a'c tttatgctatgat ftHTFct I It?" C ? ™ " A A? cf ?f ?ff SSiXSU 

ACTTTATGCTATGATTACTACCCTTCCATTACTCTAGACAXatCATGTCA h36b_ai2.seq 



10000 

10000 

9903 

9897 

9937 



TCAACTTGGTT TATCTGAACTACTTATTAGCAA 

ioieo ioi7o ioiso 



10100 

10100 

10003 

9997 

10037 



ATATTGTCATGATTTA Majority 
10190 10200 



?C^c??GG?^A?c?rMrJ^ cjblll^seq 
1CAACTTGGTTTATCTGAACTACTTATTAGGAA-ATATTGTCATGATTTA h36b_ai2.seq 



ACAACTTTATTTAACA GTCAA C T CTCTCTGAATATCGAAAACAGTACAAT Majority 



10210 



.10220 



.10230 



10240 



10250 



b 49 ACAACTTTAT'T'TAAr*rTr'^ A ^ ^^^ 2603^12.^ 

1UU46 ACAACTTT-A ITT A A C A G T C A A G TC T.CT C T G A AT A T C ft A A A A r. iTti r At t ,^7,, 
W *• C A-A G T.TTATTTAAC AGTCAA-CTC'TCTC t'G AATATCGAA A' A GAGTACAAT' te6bu4l2ls^ e< ' 



A C T C T A C C A A T C T A A T C G TG T AACTAG 



10199 
10199 
10103 
10096 
10136 



10260 
-JL. 



10270 



AGAACA AGCTTTCAAATATATTT Majority 
10280 ' 10290 



A G T C-T AC C A A T'C T-AAT'GGT- GTAACTAG 
AGTCT ACC AAT CT AA T GGTG T AACTAG 

DDBDt aBcIBDB c tQBI tEUDH tB a cQa g| 

A G T C T.A CC AATCTA ATGGTG'TAAC.T AG 
AGTCTACCAATCTAATGGTGTAACTAG 



10300 



ATCATCAATCAC. ACGTTTTACAACTTT 
10310 10320 



10249 
10249 
10152 
10146 
10186 



ATCATCAATCACACGTTTTACAACTTT 
AT CAT CAATC A CACG T T T T A C A A C TT T 

BtQS3cEIaBc A iiMii TT t t aHEaSBBSBt 

ATCATCAATCACACGTTTTACAACTTT 
ATCATCAATCACACGTTTTACAACTTT 



AGAACAAGCTTTCAAATATATTT 2603_ai2.seq 

AGAA C A A GCT T T C A A ATATATTT 18rs2I„ai2.seq 

MBaDM A aIHHt TIMS1A TBH A T T T 515_ai2.seq 

AGAACAAGCTTTCAAATATATTT cjbl H_ai2.seq 

AGAACAAGCTTTCAAATATATTT h36b_ai2.seq 

TAAAA TTTTTGATCACGAATCAT Majority 
10330 * IO340 10350 



TAAAATTTTTGATCACGAATCAT 2603_ai2.seq 
T A A AATTTT T G A T C A C CA A T GAT 18rs21_ai2*.seq 
T A-A ABHBSDt T0AT cQcEESt G A0 515_aI2.seq 
TAAAATTTTTGATCACGAATGAT cjbl U_ai2.seq 
TAAAATTTTTGATCACGAATCAT h36b_ai2.seq 
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t c a c c fAgcw xmq^ m«CA,c aar-ypr |pp t caa a aatttccactatcttc Ma,^ — 

10 ? 60 'OfTO 10380 10390 J0400 

S ?rl^r!?rTl?^ TTIa " tACITTA " CU "*" T «»IUCTli 2603 a!2 sea 
mot B r iHBHHHH C ° T \I A ± < L11 A C T J& T ^£ T ^* A A A ATT T C GACTATCTTG 18rs¥l alHeo 

10196 T C A T r "Tr T I B C C T TB A C TB 1'OD TUB aQ 10] T&B D T AfJ C TflH 515 a 2 ^ 

0 23 1 Jiy aCn * CTITAGTCUAUTIICe " T *™»™ CJbTl al^sea 

10236 TCAGCCAGGTTACCTTTAACTTACTTTAGTGAAAAATTTGGACTATCTTG h36b_a7 2 .seq 

TGCAACTCCTTA TCGCA TACGAAAACATATTACTCCATTACTAGAAAAAC Majority 
t0 ?'° 10420 '0430 10440 10450 

!SS "GOACTGCTT!^^^ A ^ A ' AACATA "* CTCCAITACTA0AAAAAi 2603_al2.se* 
n^fi JMMML- LUm C m A !L; A C A J. A TTACTCCAT T ACT A G A A A A A C 18rs2I a!2 Jeq 

0246 J?ffffS^3 l ??? nC ™ A ™ A|,l,AC A H A T TEG TUB AH THEE ADBJIH AIM 515 af ^ 

10246 TGCAACTGCTTATCGCATACGAAAACATATTAGTCCATTATT iriii » »r ~7, TT 1 
10286 TGCAACTGCTTATCGCATACGAAAACATATTAGTC C0T TACTAGAAAAAC h36b^.aT2!seq e< ' 

TTCGATTTCACATTCTCAAAAA TACTATTACCCCTC-ACCAGTATCGAAT Majority 
' t0 ? 6Q '°f"> 10480 I0 490 • io500 

S TTCCATTTririTTfTriiii ATA ' IATIA '' CSTC AC 2603 al2.se., 

« m i& ™ £Mf ah ad t ddqd tdd c cbbt g^bbb&tb « i aB Sffifr 

TCGCTATTTAATCGCATTTTTA AATGCTCAATTTGGTATACAAGTTTATC Majority 
^ 10510 10530 10540 io550 

10448 TC GCTATTT,AATCCCATTTTTAAATGCTCAATTTGGTATAGA ACTTTATC 2fifn 

10448 TCGCT A TXT A A T£G C A TTTT TAAAT GC T C A A T T T G AAfi???J?c S iHea 

0341 ram t a tbb a aub catBt Timn gb Tin ahb tb g — i gb ah tddd ?b sisl i 

10345 TCGCT ATTTAATCGC ATTT TTAAATGC T™A TTT C gT^c" A G T T^T C c bit 
10385 TCGCTATTTAATCGC aQQ T TTAAATGCT cflj ATTTGGTATAGAAGTTTATC 

CCATGTCTAAGATGGATAAATTGCTCATC AAACGATTGTTATTAGAACAC Majority 
10560 >Q570 10580 10590 I06OO 

xs ^iryfjTTHyyj^ 2 6 03_ai 2 .se q 

OaS Mill tH ? m JiffJJ T £ A TCA A A C G A T T G T T A TTAGA AC AC I8rs21_ai2.sea 

J 0 **? C M«M T « T A iB AEMM AO A A aBtIMtB aBPH A aBg||t Tfl t|I a WMMSSSSMMM ^ al2 — 

10395 C CATCT CTAAGATCGATAAATTGCTCATCAA AC GAT TG T? A TT AG A AC AC c bm l£***n 
10435 C cQ T-H TCTAACATGGATAA A"T TGC T*B ATCAAACGATTGTTATTAGA Afl A C del"S X 

TCAACTAC TTTTACT G C TTCTCATTACTTCC-CAAATACATTTATTTTCT Majority 
10610 10620 10630 • 10640 

J!?? 4 ! TCAACTACTTTTACTGCTTCTCATTACTTCC-CAAATACATTTATTTTCT 2603 al2 sea 
]° 5 A 48 C £A C T A SLEJL T _l A l C T G _ C T Y C T C A TTAC T T C C-CAA A T A C A T THAT TT T ISrlif aifsea 
0445 jSr^???^! llT,B!TT,llC A|BBBlTBcnEm|ATlM|AnT||A^nH?c? 5lTl\fl^ 

10445 tcaactacttttactgcttctcattacttcc-caaatacat tta tttt c t oihTn ^tr 1 ™ 

10485 TCAACTACTTTT'A CTGCTTCTCATTACTTC C - C0 AATACATTTATTTTCT tltl^l^ 

TTGATACATT GTTGTC. TCTATCATGGAAACGTATTAATTATAATGTAGTT Majority 
• 10660 IP?™ 10680 , 106 90 10700 

5°? 97 TJGATACAT.TGTT.GTCTCTATCATGGAAACGTATTAATTATAATGTAGTt 2603_ai2 sea 
10597 T tCAT ACAT T G T T G T, 0 T C T A T.C ATG G A A, A C 6 T ATTA ATTATA ATGT AGTT It^tl ^HL 

1 JW? 5? 1 1 ?9f T «™™ T G ab abb T a t T ab xbdd I abbb Aim S seq 

^TCATACAT TGT T G T C T C T A TCATG GAAACGTATTAA TTATA aTcI? A^^T clblll al2 sea 
.10534 TTG ATACAT -T B T TGTCTCTATC A' T G G' A- A A CGTATTA ATT ATA ATGT AGTT ^eb^al^lseq 

■CTCCCTTACT C ATCC CTTT.TCAC TGA.A CTAC A A AAT AT ATTTATCTA Majority 

10 l l ° 10720 10730 10740 10750 

lost G^CCCT^ Ari' ' ^ A jCCCTTTTCACTGAACTACAAAATATATTTATCTA 2603.ai2.seq 

S B Tin cllAr Tnm m . m t »b^ A £ T _ G _^ A JLI A J& A ^ A £ T ^ T A i T T A T A is^i.aiileq 

\f 5 f 4 ■ ™ !!« I 11 ™ ™ cB-tubbi ad Tmmn A n ah ab tb t ab t t abb t a 515^2 . SGq 

10544 CTCCCTTACT - - C ATC C C T T TT C AC T G A A CT AC A A A AT A T AT TT A T C T A clblll al2 ^ 
10584 CTCCCTTACT - - C aQ CCCTTTTC aQ TGAACTACAAAATATATTTATCTA Kell^ls^ 

TGATACATTACAATATTGT GTCAA.AAATCTTATTATAGATTCCTTTAAAA Majority 
10 7 6Q , 10770 10780 , 1Q79Q 10800 

10694 TGATACATTACAATATTGTGTCAAAAATGTTATTATAGATTCCTTTAAAA 2603 ai2 s M 
10694 TGA T AC A T T ACA AT A T T G T GTC A AA A ATGTTATTAT AGATT TTT S£ lifter, 

rm^J YG - A I AGA IT A ^ AAT ATTCTGTCAAAAATGTTATTATAGATTCCTTTAAAA cjbltl ai2^seq 
10631 TGATACATTACAATATTGTGTCAAAAATGTTATTATACATTCCTTTAAAA h36b^T 2 . s ^ 
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t t a a t jg n 1 £g £EB lS45g BEi El att tttcttcc-- r i ^^z 

10810 *Qf2Q 10830 10840 10 85O 

10744 TTAATTTAAAAAAAG AC GATATAGACTATATTTTTCTTGC~ - - T ?fin^ a io ™ 

0744 TTAATTTA A A A A A A G A C G AT ATA G A C T A T A TTT T T C T T G C - - £ \TJL?it\TL, 

\™» I I™ A 6 aHaBaII AD TOT aBKHt TO T ^ G c IllltillMn ! 

10641 TTAATTTAAAAAAAGACGATATAGACTATATTTTTCTTC T ^h77i -7^ 

10681 TTAATTTAAAAAAAGACGATATAGACTATATTTTTCTTGC - ^"lls^T 

TACCTTACTTCCCATAACTC T TTTTCTAATCCAAATTGGAC - -TGAGAAG Majority 

10860 * 087 0 10880 10890 . ^0900 

10785 TACCT TACTTCCCATAACTCTTTTTCTAATCCAAATT CCAC-.TGAGAAC 2603 

CGTATCGA TA.ATGTA ATA - GCTATTTTCGAAAATTATCCCA, AATTCCAAA. Majority 
" 10 ? 10 'Q?20 10930 10940 ■ ^0950 

CG?^crA^ A ^r^ AA I A -h^^ TTTTCGAAAATTATCC^:AAATTCCAAA ^al2.se q 
IU833 CGTATCC ATA A TGT A A T A - G C T ATTTTCCAAAATTATCCTAA A T T r r a a a iq~-o* 

CGTATCGATA ATGT AAT A -GCTATTTTCGAAAATTATCCCAA A T *r r- r a a a ~tt.i t • <o 
10770 CGTATCGATAATGTAATA-GCTATTTTCG.AAAA^ A ?C.CCA A 

AATTATT ACACCCACTCAAAGATGCTCTTCC - - CTTATCTCGCTCCTATC Majority 
' J 0 ? 60 1097O 1098Q 1O990 UO QO 

ATGAT GAGTTGCTAAAAGTTGCTATCTTTTTTTTCCGAACATTTATTTTA Majority 

' ' 11010 1102 Q no3o U040 Twso 

10930 ATGATGAGTTGGTAAAA GTTGCTATCTTTTTTTBc CGAAC ATTTATTTTA 2603 ^ 

S bhbb tB a lim M mtetikt 7 1 ?ti£A T &'*& Vj t 1 *Z±Z±± c a V. " ? A ™^ 

25? f!f!;f a- sf!BBBf!s ? ?^SS?f S;?H!ff gfflfl; ; ; ; i 

10867 ATCATCACT-TCGTAAAACTTGCTATCTT^ 

G G A T.T AAATCAATTAATCCCTGAAA C AATT- - TCATTTCCTTCATGG A A C Majority 
11060 ■ HOTO U080 11090 moo 

S C^aTtTI^^ 2603_al2.seq 
10980 G G AT T A A A T C A A T T A A T C C CT G A A A C A ATT - -TCATTTCCTTPATrr & at io„o. , -> 

i££ flif ? i A A I c A ABB1B A TncHT A BHn ah tqqbd * t "bc " com t jwr slTllf 1 ^ 

10877 GGATTAAATCAATTAATCCCTGAAACAATT- - TCATTTCCTT CAU G A^C rihfl 1 *T*\~, 
10917 GGATTAAATCAATTAATCCCTGAAACAATT - - TCATTTCCTTCATGG A AC ^6b^.a72^seq^ 

TATCAT AGACATGATAAATTAACTACTATTCTCCCACCGATAATTACAAA Majority 
11110 «J2p_ 11130 U140 1U50 

11027 
11028 
10938 | 

Vnicii u a c at. ; ; t ra ; ; c tact :^ ^ s s x IZiK&V* tin $2S£2n 

T T G G T T A A G T A A A A T T G G A G A A T A - •' - CACCTTT A A. A G A A C A AC A T T T T C Majority 
11160 llfW X1180 11190 U200 

;;°^'^J|ttT^ 

11078 T T G GTTA A. G T A AA ATTGGAGAA t| B - - - CACGTTTAAAG AACAACATTTTP olt > «! • 

J0937 T T CEDOD ABT HI A|l T fl G A Q aB T^OSi C HH T TDD A A D A DEE A OB T tQ^ sfs^s^ 

10975 T T GGTTAAGTAAAATTGGAGAATA CACGTTTAAAG A ACA A^CA T tTtC ciblil a!2 seki 

11015 n T C G T T A A G TQ A AATTGGAGAATA - - - C A c|J T T T A A A G A A C A A C A i T T T C M6b"«!^ 

TTCTCCTTT GTACTCATCT AG AAA-GAATCATCAAAAATCATATTCCTCCG Majority 
"210 11220 11230 11240 11250 

TTCTCCTTTGTACTCATCTAGAAAGAATCATCAAAAATCATA.TTCCTCCG 2603 al2 sea 
11125 TTCT CCTTTGTACTCATCT A G A A AG AATCATC AAAAATCATATTCCTf Pf Tn^Vl\Tl^ 
U037 BDBBC C TBX GBABBC A TflQ AflOraBI A ABC ABOaDaB t ? C mac CD 

11022 TTCTCCTTTGTACTCATCTAGAAAGAATCATCAAAAATC ATUTCTTCfr rlhTTl ^?<u* 
11062 TTCTCCTTTGTACTCAT C TAGAAAGAATCATCrAluu^ A JIcacCG ^l^tseT 
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U260 



U 270 



11174 
11175 



TTTATTAATA AC 

-, — j 

11280 11290 



CAAATTTTA Majority 



11300 



C T T T.A TTAATAACCA AATTTTA 2603 al2.seq 
_ATA_CAG A - - T A CC CC T A CjA A C T AC AC ACTTTATTA ATAACC 



ATACAGA--TAGCCGTACTAACTACAGA 

A A T T T T A 18rs21_al2.seq 
515_al2.seq 

1U12 ATACAGA.-TAGCCGTACTAACTACAGACTTTATTuiAA^cIiluin SS^!^" 



11086 Bt aHa g aBDt a olicEKDDnA MIHf| fl tBBBOa aQa aHDBa a tDtHB 

11072 ATACAGA - - T A G C C C T A C T A A C T A'C-AC AC T TT A TT A A T A A C C A A A T T T TA 



ACAGAATGCYTATTACAGACATTTTCTTCTAAAAACATTCATTTCCACCC 



Majority 



11310 



11320 



11330 



11340 



11350 



H222 
11223 
11136 
11120 
11160 



ACAGAATGCTTATTAC 
A C A GAATCC T TA T T AC 

UlflG aQt gHtIMIt t| 

ACAGAATGCTTATTAC 
ACAGAATGCTTATTAC 



AGAGATTTTCTTCTAAAAACATTCATTTCCACCC 2603_ai2.seq 
i^ACATT T T C T T C T A A A A A G A TTCATTTCCACCC 18rs21_ai2.seq 

— WI T t t cmmmx iiilllll a t t c aHBt c r iWI cW 5i5_ai2. seq 

AGAGATTTTCTTCTAAAAACATTCATTTCCACCC cjblll_al2.seq 
AGAGATTTTCTTCTAAAAAGATTCATTTCCACCC h36b_ai2.seq 



TTACTATTTATTAACTGATCATCTTTCC 



11360 



11370 



11272 
11273 
11181 
11170 
11210 



AATATTACTAATCTTAATCCAG Majority 
11390 11400 



11380 i i^yu 1 1400 

TTCCAATATTACTAATCTTAATCCAG 2603_al2.seq 
T TCC A AT AT TACTAATCTTAATCCAG 18rs21 al2 



TTACTATTTATTAACTGATGATCT 
T T A C T A T T TAT T A A C T 

EPBBP A |j xHHl t a aD t| 

TTACTATTTATTAACT 
TTACTATTTATTAACT 



m A T £ T T i C J£ A .i T A ,i T AC T A A TC T T A A TCC A G 18rs21_ai2.seq 

HaHBa tBttDBBIaHt aHtBBtHaHBtIIa aBaB 51s_.ai2.seq 

GATGATCTTTCCAATATTACTAATCTTAATCCAG cjbl ll„al2.seq 
GATGATCTTTCCAATATTACTAATCTTAATCCAG h36b_al2.seq 



ACATTATTATTACCAATAAAAAGCTTTCTCCCT TT 
11410 11420 11430 



ATCA AAC ATCACATT Majority 



1 1440 



11450 



11322 
11323 
11227 
11220 
11260 



ACATTATTATTACCAATAAAAAGCTTTCTCCCTTTATCAAACATGAGATT 2603 al2 seq 
A^_R_nni ZwJ* I A m A » C A A T _4_4 A A J_ G C _L T T _£ T _i C C T T T A T CAAACATC A G A T T 18rs2l_alTseq 
ABDDiTinT aHBaHc A A TlDAAllG CUT t||t|I CUT tQa tHHBBBBH aESI ill 5l5„ai2 seq 
ACATTATTATTACCAATAAAAAGCTTTCTCCCTTTATCAAACATGAGATT cjbill ai^seq 
ACATTATTATTACCAATAAAAAGCTTTCTCCCTTTATCAAACATGAGATT h36b_ai2.seq 



TCTTCAGAGAGTTTAAT. TA CATATATTGATCTGATTAATACTTCAGACCA, Majority 
11460 11470 n_ftri iMon , ,™ 




AATCAATC A A ATCCAAAA AATTATTTCATCAATACACGAACAAAAATATT Majority 

. u ? 10 »S20 11530 1 1540 U550 

11422 AATCAATCAAATCCAAAAAATTATTT C A TCAAT.AC AGGA AGA AAAATATT 2603 aiZ seq 
J" 23 A - L CA A -I C A A A JL£ C A _4 A. A A A T T ATT T C ATC A A T A C A G G A A G A A A A A T A T T 18rs2i alzTseq 

11319 AEC. A A ABBC AH tlli Hill IIIIIS ATACBgD ADD AD A A A T ADD 51 5 aiZ.seq 

11320 A A T C A A T C A A A T C C A A A* A A A T T A T T T C A T C A A T A C A G G A" A G A A A A A T A™ cjbill ai^seq 
11360 AATCAATCAAATCCAAAAAATTATTT CATCAATACAGGAAGA AAAATATT h36b_ai2.seq 

GCAAACTTTTT GCAAAAACTAATGAAATAACTAATCGTAGCTCCT Majority 

U? 60 "570 11580 11590 - 11600 

11472 GCAAACTTTTTGCAAAAACT AATGAAATAACTAATCGTAGCTCCT - 2603 ai2 seq 

* 14 P A C L A JL A _-L T -1I T T L-_. C J A L A A A A _ G _ T - A A T G ; A AA T A A G T A AT CGT A GCT CC T - ~: 18rs2Ual2^eq 

1 1369 H cDDDRffl x tDD a|1 aOQDH a a t gDQDt a a cQ aQt cUDgc tI|c TtMnODI sis aii ^ 

11370 G .C A AAC tT IT TCC A A A A A C" TAATG A /A ATA AG TA AT C G t A G C T C C T - - " clbXll kl2 sea 
11410 G C A A A C T T T T T C C A A A A A C T A A T G A A A T A A CTA AT C G TAG C TCC T hS^&Vseq^ 

A T A A C T C T T A A A A A T T A A C A T T A A A A A G C T ~ A G A G C A T T G T G T A A T G C T C Majority 
Hf 10 U630 11640 U650 

11517 ATAACTCTTAAAAATTAACATTAAAAAGCT-AGAGCATTGTGTAATGCTC 2603_ai2 seq 

11518 A T A A C T C T T A A A A A T T A_A C ATT A A A A A G C T - A G AG C ATTG T G T A A TGCTC 18rs21 aizTseq 
11419 A T aHcBcBtA A A A A tDDDD A BE A El A aQg C t|I A EBB cQDOBItG Tjg A BB— 515 al2 seq 

11415 ATAACTCTTA AAAA TTAAC ATTA A A AACCT -'AC AG C ATTCTC TA ATGC tfi cjbTl.l aI2.seq 

11455 ATAACTCTTAAA.AAT T A A CATTAA A'A AGCT - AC AC C ATICTCT A ATGCTC h36b_a72.seq 

TAGCTTTTTAATGTTAATTTTTTTGAATAATATAATCCAACTTTTCAACT Majority 
11660 H670 1 1680 U690 11700 

11566 TACCTTTTTAATCTTAATTTTTTTCAATAATATAATCCAACTTTTCAACT 2603_ai2.seq 

11567 T A CCTT-T TTAATGTT AAT T TTTTTT G A A TA A T AT A A TCC A ACT TTTC A ACT 18rs21_al2.seq 
11466 ■ A cBtBt THAlB!GHT H! TDini G A AlliaT AB A aIUcDaBt tBtHHa cQ 515 ali.seq 
U464 TAGCTTTTTAATGTTAATTTTTTTGAATAATATAATCCAACT TTTCA A CT cjblil_al2^eq 
11504 TAGCTTTTTA ATGTTAATTTTTTTGAATAATATAATCCA ACTTTTCAACT h36b_ai2 seq 
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11710 



11720 



11730 



TTTACCA ATATTCTGTTG Majority 
11740 11750 



11616 
11617 
11514 
11514 
11554 



GTTTTTTCCCATGTCAAATCTTCTTTAATTCTTTTACCAATATTCTCTTC 2603 a!2 sea 
C T T T T T c T G T _G A A A T G T T C TTTAA TTC TTTTAGCAATATTCTGTTG 18rs21 aiCeq 

5™™ TB n C B TG HHI A T GB T C tHH A ABB tHHt A HI R A tHSt gBtD SIS aii seo 

GTTTTTTCCCATGTGAAATGTTCTTTAATTCTTTTA G<ZA AT A T TC T GTTG 
GTTTTTTCCCATGTGAAATGTTCTTTAA 



cjblll_al2.seq 

TTCTTTTAGCAATATTCTGTTG h36b_ai2.seq 



T A G T T 



TCTCTCTTAATCC CTTATCTTTTACTAATAAATCAACACATTC 
i * • 




A T Majority 



11800 



i t 

ATCAAGAGATTCAT 26Q3_al2.seq 
* T M A G A G AT T C A T 18rs21_ai2.seq 
D Cfl GDH T|| AQ S15_ai2.seq 
AG AGATTCAT cjblll_al2.seq 
AG AGATTC AT h36b_aI2.seq 



GC ACT - 




CA ATAACTTC AACTGTT C 
i , — 

11860 11870 



11763 
11764 
11664 
11661 
11701 



ACCGCGATCTGTTGCAATAATAGC Ma J or 1 ty 
11880 11890 11900 



11807 
11808 
11714 
11705 
11745 



A C T T G A A A G T A G A C.C A G C T T CT A A A 

eh tq geh a bus com abd t tii t bbi a 



11852 
11853 
11763 
11750 
11790 



C A A TAACTTCAACTGTTCCACCGCGAT CT GTTGCAATAATAGC 2603 ai2.seq 
£ A JL TA ACT T -£ A A , C _ T _G T TC C A C C G C G A T CTG TTGCAATAA T ACC 18rs21_al2.seq 
A AlHK! THHH ABH G C gH tDBD TQQ C A A T aH-TEKB 515_ai2.seq 

CAATAACTTCAACTGTTCCACCGCGATCT GTTGCAATAATAGC cjbl U.ai^seq 

T CAATAACTTCAACTGTTCCACCGCGATCTGTTGCAATAATAGC h36b_ai2.seq 

A CT T C A A AGTAG ACCAGCTT CTAAA • AT AGAGGTTGGTA AT C C CT C Majority 

n ? 10 - 11 ?2Q 11930 ■ 11940 1 1950 

A SI Z 5 A - t G T A G A ° C A G C IZ C I AAA ATAGAGGTTGGTAAT.CC CTC 2603_ai2.seq 

AT AGA.C.G T TCC T A A T C CC T C 18rs21_al2.seq 
— _ ____ TH—M«rilMl! T A A T c iW^ TH 515 a!2 seq 

ACTTGAAAGTAGACCAGCTTCTAAA AT AGAGGTTGGT AATCCCTC cjblll aiZ-seq 

ACTTGAAAGTAGACCAGCTTCTAAA ATAGAGGTTCGTAATCCCTC h36b_ai2.seq 

TGCATACATTGAACGGTA AACAAACATATCAGTCTGT-CCCATTAAACAC Majority - 
m l1 ? 60 11970 11980 11990 12000 

TGGATACATTGAAGGGTAAACAAAGATATCAGTCTGT-GCCATTAAAGAC 2603_ai2.seq 
Wmmmmn A °im T G A lL C T , AA AC A A A G A £ A JL£L A GT CTGT-CCC A TTA A AG A C 18rs2I_ai27seq 

Hmaoa a cm ? g am g nm ah ahbp abb abb ch gtBgcc A nanaaB a c 515 ai2. S eq 

TGGATACATTGAAGGGTAAACAA AG A TATC AGTCTGT - GCCATTA AAGAC cjblll^irseq 
TGGATACATTGAAGGGTAAACAAAGATATCAGTCTGT-GCCATTAAAGAC h36b_a!2.seq 

AT.AGTCTGTTCAAA 
12010 



GTTTA ATTTCCCCAAAAACTTAATCTCTTTCCACTC Majority 
12020 12030 12040 12050 



G A C T G 2603_ai2.seq 



11901 ATAGTCTGTTCAAAGTTTAATTTCeCCAAAAAGTTAATCTGTTTG 

^mZ-m&i&ZS. T C 1U T £± A J A ^: TTT A A T T T ^ € £■£ A LMMM^ A.f C T ST.TT C G A CT G lSrsEI^.sVl 

11813 EPaBtC TGBtHHIaDt T T A DO IMMME AUHBIB ABIC tDDEHg G aHHG 515_al2 seq ' 

11799 AT A GTGTG^TCAA^G'TTf A'^T;PR d X^f fff X AT C #??T G G A Tt G cjbili al^seq 

11839 ATA- G TC T C TT C A A A G T T t A AT'.t T'C't'CG ^A^T^AT CTC T TT G G A CT C hUb^Z. S&T 

A AT G T GC T A A TT C A G G T CC CTCTCCTGCAATCTG.T A Majority 
12070 12080 12090 12100 



ATA T T TC TC TT T C A 

1 1 1 1 '■ — 

12060 



11951 



ATATTTCTCTTTCAAATGTGCTAATTCAGGTCCGTCTCCTGCAATCTGTA 2603_ai2.seq 
11952 A T A IT T C HE 'CTT T C A A A T GTCCT A ATTCAG G TCCC T CTCC T C.CAA T C T CT A 18rs21 alCeq 

H863 n t. ahh Timg th cm aheebbb Aiiitim c todd tbd c n tedob t cddt a sis a^eq 

11849 ATATTTCTC.TTTC.AAATGTGCTAATTCAGGTCCGTCTCCtGCAATCtCTA cjblll^ai^seq 
A T A T T T C T C T T T C A A A T G T G C T A A T T C A G G T G C G T C T C C T G .CA A T C T G T A h36b_a!2.seq 

AATAAACATTTTCA-CAGTACTCTCAC.ATCGAAAATCCTTCTAACAGCAA Majority 
i2I1 ° 12120 12130 12140 12150 



11889 



12001 
12002 
11913 
11899 
11939 



12110 

AATAAACA, TTTTC A - GAGTACTGTGACAT C N G AAA-ATGCTTCTAAGAGCAA 2603_al2 seq 
A^A T A_A_A C AJT TTTC A - GAGTA C'TGTGA C A TC G A A AATCC T T C T A A G A G C A A 18rs2I aii^seq 
iilTAlliCABTTTCAlilAHT aBEHMSI CD t|B A > [3X9X1 TBB TBI All A G C A A 515 aii seq 

AATAAACA TT TTCA-GAGTACTGTGACATCGAAAATGCTTCTAACAGCAA cjblll a!2 seq 
AATAAACAT'TTTCA - GAGTA'CTGTGACATCGAAAATGCTTCTAAGAGCAA h36b„al2.seq 
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^^^°^dZ 6/078m ^^^ w ^^^^ ^PCT/US2005/027239 

_— ____ 12 P° 12J80 izlao Soo 

12050 T T C A ATGCCTTTTT CTTTA ataattj-t.^J- ■ _ „ ; ■ ■ 1_ 

12051 TTCAATCCC ITTT n 1^1 ^ ? " U,CIGAICUAAI " 2603_a,2. seq 

1 1963 D TU B IDI T TDD T C T T » T M fl H C aRR aR ! iW^. A A A T _i__ "^Urt-tL, 
11948 TTCAATGCCT T™ T C T T l7^^ c £ A CC a"T r a^F V^ BBEB A A A T ™ S.S_al2.seq 

C A T C A G C A G A T T T T T C. A A C C T A A G C C C T A C C A G C A A A A T C A C A C. C. C T A , . _.,„. 
12 ?'° ^2220 12230 ,2240 i_T 50 

12038 CATCAGCAGATTTTTC.AAGGTAAGCCGTA C H AGCAAAATCAGAGCGTAGA _3<^_a72 ^seq 6 '' 
CTTTCAGATACCGAATTATAAATAACTCCTTTAGftT TCTATATTA A A ATn 
_ jgggg 12 ?*> 12f 80 ^ 72oO 

12059 H T T T__ A__ » MMMh ' t___1_ IMl l l I ll A «_ ^ T A T _i T T ._£ A A .I__ «»«*2>_«12.2e, 

12048 C T T TC AG aTTc^T? 1 T aTS^Tt^^ I D A ™ !™ T A T D T T ™8 A A_M S^a^.seq ^ 
12088 CTTTCAG *TACCGAATTATAAATAACTCCTTT-AG teSbilal^lseq 8 ''' 
TTTTAACCATTCAACGCTTCTCTTCGATACCCCATA A l ^CTCCACCAT ferity 
12310 12320 12330 12340 ~~ ^ s0 

AA.TCCTTAAC-ACCCGC-TGTGAGA. AGATGTTCATAfi A TACC TCOA A A c 'k a '& KaJority 
STaTIITTaTc^^ 2603_al2.se., 

£2 WSf 1 T ™ c ™n_«_ic J «■§_ _£ « ? la'tMMxm c : ng A A n A A In r^' 2 ^ 

12148 A A T G C T T A A C A C C C G CTG T C A C A A G A T C T a T a r ? t a r r r , J;™ 1 H AH 5I ?- aA2 *seq 
12188 A A T G C T.T AACACGCGCTGTGAGAAGATGTTCATAGATA CC TCCAAAGAAA „36b_ai2!seq e< ' 
TCTAAAAAACCATTATTGACACAAAAATGACTTGA C CCATCGTnTAA.^ 
12 ?'° iff® 12430 ,2440 ~ MSO 

12198 T C T A A A KAA C C A tTTt T G A C A C aTT? a .9 T___1___U C A HE G T cQaQa aQ 515_a,2.seq 

AATACTAGGTAAATGGTCATTCTTTGCAAA- - - A C A T A r. c C C T T C T A r. r « HaJorlty 
13460 , 12470. 12480 ,2490 ^oo- 

12248 A A T A C T AtU A A A T C " " G A tTT t • • A _ _™"" _ G A T|aUUU CTTC_ A__EEI 515_aI2. S eq 

.2288 A A T A. C T A G'G T A A A- T.<G fe : T pjp.T T C T T T G G A- A A - I - A G A TAG C C fc-T T C T A G C G M6b_al_!si|^ < ' ' 

T T G T T A A C T G A A A A C G T G T A T T A C A A A T C A C A A A A T C A A T A T T T T r. A T r t f 

. — 12520 12530 , ^0 ^ so 

vmfflfflmmmmm 

CAAACATATTTCATCAGnGTCTTCTATTCTCGATT -TTTGTTA axa toJorlty 

. - 12S70, K580 I2S9 0 7^00 

12447 G A A A C A T A T.T T C A T C A G C G'T'G TTCIATICTCf ATT ' ' t t t n - •- . " : *" 

12448 C.AAACATATTTCATCACCGT'eTtfTATTrTrrjTT TTTGTTA ATA 2603_al2.seq 
12346 If A__D[C ABA T TBBBB C AM c MMBhS rWi T T ZnZ T . TTC J, T A AT A 18 re2 I_a l2 . seq 
12345 7 AAA C AT A 4 C AG cffffft T A T J c" C T t"" 1 "" ^ T cQ T ABB A 515_a,2. S e q ^ 
12385 GAAACATATTTC. ATCAGCGTGTTGTATTCTCGATT - - - - - TTTCTT A All _36b_aI_!seq e< ' , 
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Ji Ml^l * 
with Weighted residue weight table. ^^r* ( 



Mgnment R^t^'O 2006/078318 ein method Weighted residue weight 'table. ^PCTAJS2005/027239 » 

ihonsctay, July 29, 2uw t>:^ nw * 

A T A G G B yy"f ^ H ipg i-l l! 5 Gj jC y iG^C y GACAATATTTTTGCTCGC TAA Majority 

12610 12620 12630 12640 12650 

!^ il^SS^ItSS GCTCCTTC ACAATATTTTTGGTCGGTAA 2603_al2.seq 

!n a M pat a W C r_^___LJ__l^„ G « T _ C _ C _ T - T ° A C AA T A T T T T TGGT CCG T A A I8rs21_al2.seq 
™ ^iT^ ! I A B cilllll lHlHl GB tHHB T G aBBBt aBtTT tQUtIHIt AO 515_ai2_seq 

' ^ A T A ^ A 3 AGC GCTGCTTG ACAATATTTTTGCTCGGT A A cJbUl^seq 

12430 ATAGGATAGC GCTGCTTGACAATATTTTTGGTCGGTAA h36b_a!2.seq 

ACGGTAAATT TT T C TA CCCTTGTCTTCATCTATAATCGGTAAATCA. Majority 

12660 12670 12680 12690 12700 

I5f30 ACGGTAAATTTTTC TACCCTTGTCTTCATCTATAATCGGTAAATCA 2603 al2.seq 

ltmm A A A \!_LL15_~_:_:_"_ T . A _ C CC , TTC TCTTCATCTATAAT C GCT A A ATC A 18^21 aliTseq 

12446 UDDA A A tDODDDDSSEHaE! C CDEI T C T TO A T C T aBaHt CEKKIA ADKllA 515 all seq 

ACGGTAAATTTTTC TACCCTTGTCTTCATCTATAATCG-GTA aTFcT A cjbTll ai^seq 

12468 ACGGTAAATTTTTC -TACCCTTGTCTTCATCTATAATCGGTAAATCA Ii36b_al2.seq 

CCATGATTAGTTGTTACAA TAACAACACGGTACCCACCCTTAACCAAATC Majority 

12710 '2720 12p0 12740 12750 

iofS CCATGATTA G T T G TT AC UT A A C A AC AC GOT ACCC AC GCTT A ACC A A AT C 2603 al2.seq 

H A JJL AT -L AG -I T G W T A C AA1 A A _ C _ A _ A _ C A CCC TAC .C C ACC CTTAACCAAATC ISrsil al 2 Tseq 

S A iHn > t Iseks t gH t a cdbha una a n mi am eh aHcbd adhc sis aii.se, 

2£2 ^ ATG ATTAGTTGTTACAATAACA AC ACGGTAGCC ACGCTTAAcTa A ATC cjblll_al2% 
I25I4 CCATGATTAGTTCTTACAATAACAACACGGTAGCCACGCTTAACCAAATC h36b„a7 2 . S eq 

TCCTGTCATTTT ATCTGTATAACCTTCAATACCTCCCAGGAAGCCTAGAT Majority 
12760 12770 12780 12790 12800 




AACGTTCAATACCTC CGAGGAAGGGTAGAT 2603_ai2 seq 
A A l CG T T C A A i A i£ T C C G A G G A A G G G T AG A T I8rs2l_ai2.seq 
A AlMMlT C A AQ aJSI tBH G|I gQBBDDO tDB A T 5lS_ai2.seq 



AIM AMMU TMfl GM GMHMM TBMM A T 5l5_ai2.seq 
ATACCTCCGAGGAAGGGTAGAT cjbUl_al2.seq 
ATACCTCCGAGGAAGGGTAGAT h36b_al2.seq 

AATATCCTGAG AAAACACCAACTGTTTTT.ACCTTATTTTCCATATTTATC Majority 

" . IZZIQ 12820 12830 12840 . 12850 

1Z676 AATATCCTGAGAAAACAGCAACTGTTTTTACCTTATTTTCCATATTTATC 2603 a!2 seq . 

Si A I A m G W G 4S A L A A A f - - G A AC _ T _ G - TTTTTACCTTATTTTC C AT ATTT ATC ISrsi^^eq 

12596 DAT A HQ cH gBEI aQQ aQDDBaBEEBI t T tQ[ tH| CCTTATTT tB cDHHHHH a|3 C 515_ai2 seq 

12574 AATATCCTCA'GAAAACAGCAACTGTTTTTA CCTTATTTTC CATATTTAT C clblll al2 seo 

12614 AATATCCTGACAAAACAGCAACTGTTTTTA.CCTTATTTTCCATATTTATC h36baT2 seq 



12726 



C A C T T T C A T C A AT AAGCCA TCTTTTAA G C CTTTAATCATAGCAACTATTT Majority 

12860 12870 12880 12890 12900 

CACTTTCATCAATAAGCCATCTTTTAAGCCTTTAATCATAGCAACTATTT 2603 ai2 seq 
III?, ^ A W T W T C W^ C & A T AA GCC A TCTTTTAAC CCTT TAA TCAT A G C A ACT AT T T ISrsil alzTseq 
1264 C aHtHt COO CD A T A BBBHBBB THS E tB aQODB A Cfl A DEE ATtB SIS ali.seq 

Sff 4 CACTTTCATCAATAAGCCATCTTTTAAGCCTTTAATCATAGC aTTTt A T T™ cjblll al^seq 
12664 CACTTTCATCAATAAGCCATCTTTTAAGCCTTTAATCATAGCAACTATTT h36b_al2.seq 

TTTTGCTCT TTTC CTCTTC UCT.A dC A AC K.TCG A k A A ATTC AT HCG C Majority 

. * 2 ? 10 12920. 12930 12940 12950 

J 2 * 76 T T T T G .C T C T TT TGCTCTT0GC TAC C A A C A G TCG.AA-CAAATTCATTTC GC 2603 ai2 seq 
12777 rTjr^^TJCT.TTT.GCT C TTCTGCTAC C A Ajb KG T:C 0 A A C A Ji A t T C A T T TC GC Wrsil^lCeq 
68 BtMtBt.tBt gBt C tBHHg cBenc A AQ aQ tQ gCKQQQ A aBBSStEBKI SlS^seq ^ 
t T T T G C:T C T- T T T G C T C TT C T 6.CTA C C A Ad A C * C C A A C A A A T f C-A't t T^W 6JMU a^seq 
12714 T TT TGCTC T.T TTGCT6TTCTGCTACC A AC ACTGGA AC A. A AT TC ATTT G G .C h36b_a72.seq 

A TA A AT ACT A A A T ATTT - G. TATC G C T TCTTCTTACCA T A TTTTTTT ATA'A Majority 

' ^g 60 12970 » 12980 1299Q ~ ^3000 

12826 ATAAATACTAAATATTT-.GTATC G.C T TCTTC.TT AC CATATTTTTTTATAA 2603„ai2.seq 
^ A I A .j A L ACT A AATATT T - G _ T A T _C_PjlI T,mc T T ACC A T A T T T T.T f T A T A A 18rs21_al2.seq 
12730 A T ABDDD CTAAATATT TflBj tB iBDBE! tH| T C T TIB cflB Aid TB TH T Ad AD 515 al2.seq 
12724 ATA A AT A CTAA ATATTT- GTAf CGCTTCTTCTTACCAT A?T TTTTT AT A A cjbTll ai2 seq 
12764 AT A'AATACTA A ATATTT - GTATC.GCTTCTTCTTAC CA.TATTTTTTTA TAA h36b_al2.seq 

TATAGATCGCATTGCGTAT CATGTAATATTTTCGAAATCGTGAATGATTC Majority 
' 13010 ' .. *3°20 13030 13040 i3050 

12875 TATAGATCGCATTGCGTATCATGTAATATTTTCGAAATGGTGAATGATTC 2603 al2.seq 

12876 T ATA G ATC GCA TT CC G T A T C AT C TA. AT ATTTT CGA AATGGTGA ATGATTC 18rs21 alCeq 

12780. EI aQEI g aESD c£SB tB3 gH cJSS tESSB g t|l a t aEKE59 c g aH aHIg" 1 * 21 - - - 



12773 
12813 



H AOBI G A EM GEmi TH GO GEM TWSm GTDAT A BOSS! CCA!! A 191 GEE aHH G A TEH 515_al2.seq 
TATACATCCCATTCCCTATCATGTA-ATATTTTCGAAATCCTCAATCATTC cjblll alTseq 
TATAGATCGCATTGCGTATCATGTAATATTTTCGAAATGGTGAATCATTC h36blal2.seq 
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S^SSS^!^ *PCT7US2005/027239 

_. 13060 13070 13080 1 ~~ *~ y 

2327 AO TIES T Ol ABB a BBBWI a A C*W< T A A^in xHM» A A £A£ T , CT " A A T 18re 2, a!2 sea 

12863 AATACAICAAAAACATCCCCAAATTI TT CT C J? TCC,4T cjbtn_ai? seq 

IAACTCGTGAACAGTCTCCAAT h36b_aIZ seq 

- ■ " c c " T ■ " r ' "-tr— — 

.2975 itCUTCTAACAGACCAATAAAATTAA ^7 ■ ^ ^ 

12976 T T C G T G T A ACAGACC AATAAAATTAA " ^ C " * T ' A CT C " * I * I C C C A T 2603 a!2 S eq 

Xt«...... ? C..T«„, C ., Ttf , f , f , c<tc , - ,,-,„• 

13160 13170 13180 * 

5 H ^ S AS -l^^Tfc^^ A A A l ^ A A A C A T gT L^ 

-LCTCTGACAGAGGATAATTCA^G^Aix^^^^ 
CTTCT CCAAACCC A c a TGTTTCTTPpaaaa^o^ 

^coAAAKUTunrr cnrnur, HaJorlty 

13011 CTTCTGCAAAGCCAGATGTTTCTT fcB AAAAC6CTCG TT TTCATTAAAGCA " M6b*ai2^^ e< ' 
CCCGA UTAATACACT CTTr a a -r -r t> ^ 

=r .A— ' ; 3 ; 80 ~ * T - r T ■ ' 8 ' -^^4. 

13123 CCCCAAGT AATACACTCTTC AATTTCT ' '3290 ^ 

13124 G CC G A AC T A A T A C A C T CTICAATtJcJ TTATAGTCAAATTCT-TG 2603 al2.se,, 

1306, GCeOAACTi ATUACTCITC A i^^T :: - : -TTAT CTCAA.TTCT -TG cjblll^e, 

ATAGTCAAATTCT-TG h36b_a!2.seq 
,CATCACTAAATTT- TCACCGTTr. T,-. J - TTr . T . rn , , , 

EES ? AB C A CB AB A T T T|H! c A < EKBH T cQQa $A A ^j^i T A A JU T JL 18«21^i2-seq 

™ SCA *«I A C. T AAAtBt-TCACCCTTCAt\\%%%^^^ 

«%. C - - C G A C C T T A G G T A A A0 C A A GG T A A, T-T. T ? C AT A A I I - - TT AT C T A T : C SfiSST 

^Aco^^ec,,,^,,^^^^ 

. ; 13410 13420 13430 13440 *~ 

13196 AAATCACCTAGGACAACCCAATCT?gatcta A A G T CAAGAACCAATC A A A Meb.jiT^'seq 6 '' 

Himili jKucAAAnucra ncAOTTCAAA.,^ , 

13460 13470 13480 ' 



HGURE19AC 



er r ta re; f^zgBB BBS HZ B iBi * (T^r^TrT c ttcattata r - ^^: 

13510 13520 13530 13540 
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_ _ 13530 

13270 C TUmiTDcD IK A A xll A HD a a r a n tR a r«i T «£ T £_ T £ A _ T _ T _ A _ T A A IS^LalHeq 

I32S6 CI?T^TT T m M 7 1 a r A « A tBE CBB tHDI ifl aBHHI A A SlS_al2.se,, 

.3296 CTTTATTTTCTGTTAAATAATCAACACTTArc cJM , l_alT seq 

u A b 1 >AAATAATCAACAGTTAGGTGCCCCTCTTCATTATAA h36b_al2.seq 

TCCCCTACTAATTCAGAAATTTCTTCCTTATT T TTCCAGCCATTATCTAr. teJorlty 
. 1? ? 6Q J" - 13580 ,3590 ,3600 

as:;::?:;;..;;::::::;; ; ; » j!f!s «?!™??f £Sir 



GATGTAGATATCGCTTA 



CTTCAGGA- TAAATT 



13610 



13620 



13630 



GCTCGAATCTTCTC Majority 
13640 " 13650 



SS GAT__ TACA tB T G C n T T ACTTGAGGA I A A A T T C C T C C A A T C T-T C T C 2603_ai2.se, 

13368 dobb th c A&znnm ram™ g Aim ammmmi . Ii S 5 t __S a a >_4£ t __.___ i8 ^'-*»^«. 

133S6 6aTItaC*TaH?7tT^? AOTAl_B______JO_0A T T G C T_U A A_BB T____f 515_al2._eq 

. 13396 GATGTAGAT A- TGGCT-T ACTTGAGGA - - - - - TAAATTGCTCGAATGTTCTG MGb^aT^seq 6 ^ 



13503 
13504 
13418 
13401 
13441. 



ATCTAAGCGTTCAATATTCGGGTTAAAGGTGACA A TACCCCCTAAATATT Majority 
13660 13670 13680 13690 *" ^00 



A T C T A A G C G I I r A A I a H 5 S 7! T T A A A G G * G A C A A T "A C C C C C T A A A T A T T 2603_3 12 sea 

m nti&i AiMiliSiU Ulil_c c i i_ >™ ™, ;___, S?2__r 



TCATGTTCTATGCTCTTTTCTAA AATCTCTAAATAACTCAATGAC 

*3720 13730 13740 



13710 



- - Majority 
13750 



KnAHJHWrsiUJKSJili.TSJJiiiKiis^iiKifl^aA^ 



13598 
13599 - - 
13518 
13496 
13536 - 



•TCCTCC'TTTCGTTATAAAAACCATACCCACAT ACATACTTACTfirT „ aJority 
13 760 13770 l37 80 t3 790 



— . _ 13800 

T GGTGCTTTGG T'T .3^^^^ A p GA Z! A ^^'^' A CATAGATAGTTACTGCT 2603_al2.sea. 
i u o i u c 1 T T G G T TA T A A A A A C G ATAC. CGACATAGATAGTTA C T c c t t«~->i 

T g ct 5f ? ? ? « £™ T A A A A A ™ A ? a«dbab_tBcb?__^ T gdo _£2___r' 

ibOHCTT.TCCTTATAAAAACCATACCGACATAGATAGTTACTCCT ^thui 

tggtgctttggttataaaaacgataccgacatagatagttac ?gc£ _£__£__T 



A C T A A A C TTT C A A T 



CACATAATTTAC 



13810 



13644 A. CTAAAGT T f G A A T - 

13645 ACT A A A C T T XjS.'A A T . - 
13567 HI] AD A|B.tfin A A TQ 

13542 ACT AAA C \T TtG A AT 
13582 AC T A A A C TTT G A A T :- 



13820 
_i_ 



CAA. TG ATACTGACATTTGAG Majority 

13830 13840 13850 

_t_ 



TATTG ATATAATAGAGTA CA6CTCCACTAAGA 
13860 13870 13880 



G A C A T A A T T T A C C A A T G A T A C t G A C A T T T G A G 2603 ai2.seq 
H A _£ A !__£ T AC C A A T G A T ACT&ACA T. TT.G AC ISrsSl al_Ts€* 

5-^ S A I A A T ^ C A A -T G A T A C T G A G A T T T G A C oJb7tl_al2Vseq- 

- - GAC ATAATTT ACCA ATGATACTGAdA'TTTGAG ifeU^^ 



GTAGC AGCAATTAAATA Majority 



13690 
13691 
13617 
13588 
13628 



13739 
13740 
i3665 
13637 
.13677 



13890 



tIBBt m m n mm inn ?± c JLi?J. A „ c A - c l1 ag _ c _ a g c a a t t a a a t a 'u^u^Lt 

™ T CHAT A A a^S ^???? ,,BI A|B G aBcBa_«Bc C A A T T A A A_H SI5_a 1 2. seq ^ 

TATTr«T.J AA i.B A ^ ACAC ' CTCCACTA » GA - GTAGCACC *»' T ** ATA ilblll al2 seq 
TATTGATATAATAHagTACAGCTCCACTAACA-GTAGCACCAATTAAAT a' h36b_aT2.seT^ 

CCGCAGCATTCCTCTTGTTAAfTCTTfAAAA GTAAATACATCTCTTAAAr. Majority 
13910 13920 13930 13940 TaSSO 



■ ■ t t , — ~ motiau 

G C G C A CCA?^rr^^ GTTAATTCTTTAAAAGTAAATACATCTCT TAAAG 2603 a I2 sea 

A B £ t ^_MWBH T °^ T A JLL T c T T T A A A A G T A A A T A C A T G T CTT A A A C 18rsil__ 2IL 

sasa ah c. a T Bmaaaa i«Bt aeht ch inmi c tesmki a c a_j c tmoh aaQc liTlu JL ^ 

Gcrr. C r A n^ C I^ TTGTTAATTCTTTAAAAG tT^TT A C A T C tTT? A A A C cJoTn atf sea 
GCGCAGCATTCCT.CTTGTTAATTCTTTAAAAGTAAATACATCTCTTAAAC Sib ai2 ^ 
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^f 80 J3990 14000 



13960 
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13789 
13790 
13711 
13687 
13727 



13970 



ATAGCTCCCAT A G C 
— i 

14010 



ACCTAAA ATTC G T ATTAAAACTATATTAACr.A 

14020 lAmn - 



i3839 
13840 
13752 
13737 



: "~ ^ZjuZ 14030 14040 14050 

ADAEDTin .cBt Aran c Inn a > a mm i ti a mrnwi * A ££^, c \_& ^"-a^.L, 

ATAGCTCCCAT aTcaT C T™ A UT^T. r T » « A ™™M™H A a|_H_I C A EH 515_ai2.seq 

AATAACTG CAGACATTGTGTA- 



C A A 9 Majority 
14050 



13889 
13890 
13802 
13787 
13827 



ATTT'CCCACAACTr r 

. U060 >4070 14 080 

L-Ml » — — T .£-£, A_A T A A C.T G C A C A C A T T r. T C T A 

^^ , r?P?^ n * B,TEnAB|T A ^DBBciraHnA t tBBOt aD 

l .'TwCCACAACTCCAATAAf!Tf.r a ti a * ~- ^ ^ . 



• -ACC TTTTCTA Majority 
14090 14100 



AGCTTTTGTA 2603__ai2.seq 

mraM T |Lm T i 18rs2i__ai2.seq 
— ••«^*^AA«-i-uCAATAACTCCACACATTC t7t — A EX] tHDHB Tl| 5IS_al2.seq 
ATTTGCCACAAGTCCUTAACTCC ---"^"--"-J"^ 



13935 
13936 
13852 
13833 
13873 



a i t 1 U T A U36b_ai2 

COTCTTCAA ; cc, C UUTKUUrCCCTUACC C TT t », Tttf , utT 
141t ° "f 20 H«30 I4MO T^SO 

c c § t"t-|a .iff a c™ A ™ G »™™ 3*51 a I A«raA A raS E^"* 

'"-TCAACCCACTAc'ata'^tSxc^TA^^^ 
■GCAAATGATCA TCAAAGACTCAAC 

14170 



14160 

13985 G C A A A T*G A T C A 
G C A A A TGA T C A 
G 



13902 



\ i'i (> c A a i; 



13883 GCAAATCATCA 



13923 



GCAAATCATCA 
A C C C T T T C C C A A A A A A G A. A G A r 

x 1 ~~~ 1 



. 14 * 70 - 14180 X4190 

;ffl^Sramira^^S 



ACCCAC AACTTCAt Majority 
14190 14200 



14210 



13997 

13998 

13940 A C 
13933 
13973 



14220 



CCA -AAXXXXXXXXYXXXXXXXY 




14000 
14000 
13985 



A C dCTTTC C C A A A A A XWTg A G 
ACCCTTTCCCAAAAAAGAAGAC 

XXXXXXXXXXXXXXXXXX 
14260 

1- ; • 




14240 
1 



[ 2603_al2.seq 
i8rs21_.aI2.sQq 
5l5_ai2.seq 
cjblH_jai2.seq 
h36b_al2.seq 

Majority 



14250 



I * A A it I | A I I II Mi, A I 

a - A LLU { ' A * u A A a f ! f f iir c a r 



14000 

Decoration 



A C t" It I A - 


r a a . 


\ (; a 


A 


(' 


■■ 111 ( ( ' ( B ■ ' 


C A t * 


• A A ( 


* A 


■r 



2603_ai2.seq 
18rs21_.al2.seq 
j 515__ai2.seq 
I cjbm_ai2.seq 
h36b_ai2.seq 

Majority 



2603l.al2-.seq 
18rs2JLjal2;^eq 
6l5_jal2.iseq 
ejblli_ai2;seq 
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CT/US2005/027239 

Paget 




2 

101 



52 
151 



102 
201 



152 
251 



202 
301 



351 



302 
401 



352 
451 



•402 
501 



452 
551 



S02 
,601 



GCCTTGTTCCG ATGTT G A TTCCGATAACTCCTCCCTCATTAATAGCCTGT Majority 
™ ™ 130 140 150 

FflTTT^ 2603„al2,seq 
C C C T T G T T C C G A T G T T C A THC C G A T A A C T C C T G G C T C A T T A A T A G C C T G T nem316_al2!seq 

TCGTAACGCTCTTTAATTATCTCTAACTT A GCATGGGTATTGGTAAAATT Majority 

l j° »™ 180 19Q 200 

TCCTAACCCTCTTTAATTATCTCTAACTTAGCATGGGTATTGGTAAAATT 2603 ai2 sea 



TCGTAACGCTCTTTAATTATCTCTAACTTAGCATGGGTATTGGTAAAATT nemsIe^alZ.Lq 

TTCAA.AATACACTAACTATTTATTAA C CTCGCGCCACTTT-CTATCCATCA Majority 
. 2 ?° ^ 230 240 250 

^ A MJ!r G _5J AAGTATTTATTAACCTCBGGCCACTtVctATGC ATG A 2603_.ai2.seq 
TTGAAAATAGACTAAGTATTTATTAACCTCGGGCCACTTTCTATGCATGA ne^lG^aia seq 

AATCAATTTCTTTATAGAATTGTTCACG A ATAGGAGCTTCTGGAGCAACT Majority 
2fi0 270 ->~> ■ — ~ T ~ 

_t 



280 



290 
— I_ 



300 



^TyyMlTT^TnT^ 7 ^ 2603 al2.seq 

AATCAATTTCTTTATAGAATTGTTCACGAATAGGAGCTTCTGGAGCAACT ne^To.a 1 2 seq 



ATAGCAT CCCCTGAACC AGAAACTGTGCAAAAAGTGCATCCTCCTCTAGC Majority 

350 



• 3 t 1Q 320 330 340 350 

^yjfjnrnjy^ 2^ a^seq 

ATACCATCCCCTGAACCAGAAAC'TGTGC A A AAAGTGC AT CCTCCTCTAG C . neriW^lT^eq 

AACTCtTCCCTCTCTCTTAUACACTCA A AACCAGCATCTATACGTAATT Majority 
360 370 380 390 ~ 



400 



^^^1^^ ai2.seq 
AACTGTTCCGTCTCTGTTAGGACAGTCAAAACCAGCATCTATAGGTAATT nem3T6_.ai2.seq 

TAAA TATTTTTTCTCCAAA&AGTTCTCGATAATAATCATTAATCGCACGA Majority 
flf 4?0 430 440 . 450 

T^TjTTJyy^^ 2603_ai2. S eq 
T A A A T A TT TT T T C T C C A A A G A G T T C T C G A T A A T A A T C A T T A A T C G C A C G A nem376_ai2*Lq 

TAACGTTTTTTCATAGGATAATTCTATCA C AATTTTAACtAAAATAACCT Majority 
; 2? 470 , 480 490 500 

T A ^^^3i33^J^ AT ^ GGA ^ T< *^ A ^ C ACAATTTTAA CT AAAATAACCT 2603 ai2.seq . 

T AACGXTTTTTC ATAGGAT-A ATTGT.ATC AC A ATTTT A ACT A A AATAACCT neasle.ai 2 .^eq 

C A C T A C T A C A A t A AAA CT T A A A A A C A T T G G A A CGTCAGT T A GTCCCAA- T C Majority 

510 .520 . . 530 *zn- 

j_ 



540 



£ TZZYTi^ 5 - f T A A A A C THA MAAGAfTGGA ACG TCAGTtAGTCC C,A A T C 2603 a!2.seq 
CACTACTACAATAAAACTTAAAAAGATTG-GAACGTCAGTTAGTCCC A A T C ne_37 6 _ai2 Jeq 

TT TTATTTACTTCACTTTC'TTTAACCAA TC CTTGGCTAAAA'ACATATACC Majority 

560 coh .• 



570 
~1 



580 



S90 



600 



i^n A ^I A " TCACTTTC ^ TTAACCAAT ^ CTTGCCTAAA ^ 2603 al2.seq 

T T T T.A T T T A C T'T C ACTTTCTTTA ACCAATC CTTGG C-T AAAAAGATATACG ne^al^L, 

CUTTAGATTCAAAATACCATAAGCAA GTATAAAACCAGCTAAAACATCT Majbrity 



610 



620 



630 



640 



^ A ^I A ^ A ^ CAAAATACCATAAGCAAGTATAAAACCAG CTAAAACATCT 2603 a^lseq 
CAGTTAGATTCAAAATACCATAAGCAAGTATAAAACCAGCTAAAACATCT ne^ie^al^eq 



ABonmentReport-WO 2006/078318^. ifhw '/Tw ' ^PCT7US2005/027239 

CTCG ^ A * ^^ ^ 

6 f° 680 690 700 

AAAlOAACCCCTACCTAAATACGAGATAACCCAATTAAAAAAAT ne m 316_al2^eq 

GAGCAAACCCAAAGTACCTTCCCACAACACTT-TP C ATATACTCTTACCCA Majority 
7 J° 720 730 7 40 ^ 



602 
701 



— J , . < /Oil 

TATAGTACTGCAATAAA ATAA TAATACTCCCAAATATCATAAATGTtCCr. Majority 
1^ - 7 T° 790 ^0 

TATAGTACTGCAATAAAATAATAATA-CTCCCAAATATCATAAATGTTCCC oerfiieJSeq 

ATCCACTGCCCACTGGGAAACCAATACCCACCT G CAAATACTAAATGGV.T Majority 
8 ?° *J° 830 840 

ATCGAGTGCCCACT^cgA^vgrAA^^ 

At^AGTGCCCACTGCCAAACCAATAGCCACCTGCAAATACTAAATGGCT he Ja 316_a 1 2^eq 
TAAACTTCGTCTTACTCTT TCA AA A ATAACTTTTA A A C A A A G T A T , r. A T , Majority 
- !f° . 870 880 890 900 

TACCACAGATAAtAGCAfTTACTGCGATAAAT C TACCTTGA-GGATACCAC Majority 
9 f° ««_ 930 940 

TACCAGAGATAATAGCATTTACTGCGATAAATCTACCTTGAGGATACCAC neasTe.all^eq 
rTCTTAACGTAACAGAAACTGACCCTCATA ATCCCAATACCTATCTGGrT Majority 
a f° 970 9?0 990 i^oo 

TACAGTATTACCAATCACAGTCATTAACT TGAAAAATCTTGTACAAAGAT Majority 
— 10 . 10 1030 \ 1O4 0 io^o 

T A EgM?^ 2603_al2. Seq 

.TA.CAGTATTA CCAATCACAG TGATTAACTTGAAAAATCTTGTAGAAAGAT nem376_ai2^eq 

TTGGCAACTGTCCTCTAACA.CTTTCTTGAATG T TTTGGTCAAATGCAATT Majority 
I0 . 6 ° . 10 7° 1080 10*90 noo 

TTCCCAACTGTCCTCtAAr^ ^PP TTTGGTCAAAT Gfl A ATT 2603_ai2.se, 

GGCAACTGTCCTCTAACACTTTOTTGAATGTTTTGGTC AAATGCAATT nentfie.aiZ seq 

ACACTGTCGCGCCAATA TTTGA TGA C CAATCCTAAACTGAAAAATAAGAT Majority 
l \ 1Q . U30 . U40 ~T^o 

A C A 'C T C T C G C cf f A A r^i[^^^!' A ^^^^ T ^^AA'ACT G.AAAi A A T A A G A T '2603 ai2,seq 
. G A T G A C C A A TC 'C TA A A 'C T-G- A A A A A t AA G A T ne^Te_ai2^seq 

AATAGCA ATAAAtGCtTGAATA A C T T T AC T A T T T T C A C G- A G .A T A A C A T T A Majority 
1160 . 1I7Q 1180 U9Q . 

* A J A C c A A T * A A T G C TT G A A T A A G T T T A C T A T T T T G A C G A G A T A A C A T T A 2G0^ ^ 
AATAGCAATA AATGCTTG.AATAAGTTTAC TATTTTG ACGAG AT A A C A T T X 

GTCTTTTTATATCTTTCTAATATTG G CAA ACAAGCC ACGTAAGTTAGATA Majority 
12 ™ ^ 1230 12^0 ' 12*50 

WITTTTTATATCTTTCTAATATTCGCAAACAACCCACGTAAGTTAGATA nem316_aiHeq - 
■G.AAAACA ATCGAAATTA A A ATT C C C TC AA.CGATATT A A ATCGAATAACC A Majority 
1260 1270 1280 1290 



652 
751 



702 
801 



752 
851 



802 
901 



852 
951 



902 
1001 



1051 



1002 
1 10 1 



1052 
1151 



ZjlZ . l2 *° 1290 1300 
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T T G T T l ATA'fc C tTH fgFgT Fg_ &£ H ?A AATGTTCTGATATCAAAGTTA Majority 

1310 1320 1330 1340 1350 

1202 TTGTT A AAAGCTAATTGCCTACACCAATA AATGTTCTGATATCAAAGTTA 2603_ai2.seq 
1301 TTGTTAAAAGGTAATTGCCTACACCAATAAATGTTCTGATATCAAAGTTA oe_3l6_al2.seq 

GCAAATATAGCATACAAAGGAATCGCAAAGACATACTTGACAGCTACCAT Majority 
1360 1370 1380 1390 1400 



12S2 



GCAAATATAGCATACAAAGGAATCGCAAAGACATAGTTGAGAGCTACCAT 2603_aI2.s 



1351 GCAAATATAGCATACAAAGGAATCGCAAAGACATAGTTGAGAGCTACCAT ne_3l6_ai2_seq 

AGATACGGTCAAGCTAACTCTACCAAATACACTAGCTTTAATAAAATCTT Majority 
1410 1420 1430 1440 1450 



1302 A G A T A C G GTCAAGCTA ACTGTACC AAATACACTACCTTTA ATAAAATCTT 2603_ai2.seq 

1401 AC AT A CH GTCAAGCTAACTGTACCAAATAGACTAGCTTTAATAAAATCTT nem3l6_.al2.seq 

TTGCACTCTCTCTATTTTTCCACAAAATAGCGAAACTTCC TAAAAATAAA Majority 
1460 1470 1480 1490 1500 

• 1 : 1 j— i 

1352 TTGCACTCTCTCTATTTTTCCAGAAAATAGCGAAACTTGCTAAAAATAAA 2603„ai2.seq 

1451 T'T GCACTCTCTCTATTTTTCCAGAAAATAGCGAAACTTGC TAAAAATAAA nem316_al2.seq 

GCTACAGCAACCATATTCATCCCTAAACCGATAAAGGTTTCTGGACCACG Majority 
' • 1510 1520 1530 1540 1550 

1402 CC TACACC AACCAT AT T CA TCGGTAAAC C_j A T A A A{_ GTTTCTG C AC C ACC 2603_al2.seq 

1501 •CCTAGACCAACCATATTCATCCGTAAACCGATAAAGGTTTCTGCACCA.ee ne_316_al2.seq 

' ATTAGCAAGTATAACT T.T T A A A A G T G AT CTTAATAAGAGTACACCATAAC Majority 
1560 1570 1580 1590 1600 

1452 ATTAGCAAGT. ATAACTTTT A- AAA G. TGATCTTAATAAGAGTAC ACCATAAC 2603_al2.seq 

1551 ATTAGCAAGT A TA ACTTTTA AAAGTGATCTTA ATAAG A" GTAC ACCATAAC nem3I6_ai2_seq 

TT.GATTTC AAATCAAATAAAATAAAA G.C AACT AACATCGCA ACGATTGAA Majority 

1610 1620. 1630 1640 1650 
— — , 1 1 i i_ 

1502 T.TGAT T T C-AA ATC A A AT A A A ATA A A AG'G-A A CT A AC A TC-CG A. A G G A T T G A A 2603_ai2.seq 

1601 TTGATTTC AAATCA'AATAAAATAAAACCAACTAACATC GGAA G G A T T G A A nem316_al2.seq 

AAATCAACCTTTAAAAATTCTGCTCCTGCTATTAATG G^A AATGAAACCAT Majority 
1660 1670 1680 1690 1700 

1552 AAATCAACCTTTAAAAATTCTCCTCCTGGTATTAATGGAAATGAAACCAT 2603_ai2.seq 
1651 • A- A A T C- A A C_C T T T A A A A A T T C T C C T C C T G G T A T T A A T G C A A A T G A A A C C A T nem316_al2.seq 

C AT CAATACAAAAGATAAGCCAGAAAGAATGGCGATTGTCACCATTTTAC Majority 

1710 1720 1730 1740 * 1750 
, — ■ : — — 1 1 1 i i_ 

1602 CATCA. ATACAAAAGATAAGGCAGAAAGA-ATGGCGAT-TGTCACCATTTTAC 2603_ai2.seq 

1701 C ATCAATACAAAAG AT AAGGCAGAAAGAATGCCGATTGTC ACCATTTT AC nem316„ai2.seq 

GTCTATTTCTCATAAAAAAATTCCTCCAA.TTTAAATAAATTGAAACA A G C Majority 
«■ ! « i i r~ 

1760 . . 1770 * 1780 1790 1800 

4652 G TGTATTT CT C A T A A A A A. A ATT C C T C CA A t T'T AAA T A A AT TGAAA G A A C C 2603_.ai2.seq 
.1751 G T G TA T TT CT CAT AAA A A A A T T G.C T CC A AT T T A A AT A A A T-T.C A A AG A A G C oem316_al2.seq 

T C C A A AG C T A,AG C G T A T G T A C G CCA A A AAA AC C TT TCtCT.TCT CC C AT C C Majority . 

1810 . 1820 1830 : 1840 ' 1850 

1702 T C C A A A G G t A A G C G T A T G T A C G C C A A A A A A|c CTTTGTCTTCTCCC.ATCC 2603_ai2.seq 
1801 TCCAAAGG TAAG.CGT A|B CT A C'G C G AA A A A A A C C T-TT GT.CT T CTCC C AT CC nem316_ai2.seq 

AGACTTTA. CTGTCGGTT GT GG A ATCTC ACCAC AT CACCTTTCCCTCGCGG Maloritv 

1860 1870 -1880. 1890 1900 

z « : « i : 1 : _ 

1751 AGACTTTA CT.CTC GGTTGTC'CAATCTCACCAC ATCAGCTTTC GC'TCGC'GC 2603_ai2.seq 
l«5l AGACTTTA CT GT CGGTTG.TGGAATCTCACCACATCAGCTTTCGCTCGCGG nera316_ai2.seq 

A-.CT CATCCTTCACAACTCAC'AAATAACTTCCAACCCATTACCGCCGC T C G Majority 

1910 1920 1930 1940 1950 

1 s * « — a t_ 

1801 ACTGATGCTTCACAACTGACAAATAAGTT6GAAGCGATTA. CCG C.C G G T C G 2603 ai2.seq 
1901 AC TGATGCT.TCAC A ACTG AC AAATAAGTTGGA AfiC'G ATTACCGGCG GTCG nem316_ai2.seq 
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G G A A T If X'lC'AlicC cW-jj jfcgj 6' A jgc 1 «)••» C.fc'ffA TACCATAACAAAAAAAACTTG Malorltv 

| l ; | J— 

I960 1970 1980 1990 2O00 



1851 



CGAATTACACCCTCCCCTGAAGACACCTATAGCATAACAAAAAAAACTTG 2603_ai2.seq 



1951 GGAATTACACCCTGCCCTGAAGACACCTATAGCATAACAAAAAAAACTTG nea3l6_al2. 

CAAT. TGCAAGTTTTTTAATTACTAATTAGTAGTAGTGATTAAAAATCATA Majority 

2010 2020 2030 2040 2050 

' » i 1 i— 

1901 CAATTGCAACTTTTTTAATTACTAATTACTAGTACTGA.TTAAAAATCATA 2603_ai2.seq 
2001 CAATTGCAAGTTTTTTAATTACTAATTACTAGTAGTGATTAAAAATCATA neo316_ai2.seq 

T T A A T A C G A A A T T A C T A T C C T G T A T C G T T T C T T T C A G A T T T G C T A T T T T T Majority 
2060 2070 2080 2090 2100 

i951 TTA AT ACCAAATTACTA tB CTCT ATCGTTTCTTTC AG ATTTCCTATTTTT 2603_ai2.seq 
2051 T T A AT ACCAAATTACTATGCTGTATCGTTTCTTTCAGATTTGCTATTTTT nem316__al2.seq 

AGTTTTTCTTAAAAACATAAACAAAATTCCCAAAAT'AATACAACCAAGAA Majority 

2110 2120 2130 2140 2150 

1 1 1 , 1 u. 

2001 AGTTTTTCTTAAAAAGATAAACAAAATTCCCAAAATAATACAACCAAGAA 2603_aI2.seq 
2101 A-G T T T T T C T T A A A A A G.A T A A A- C A A A A T T C C C A A A A T A A T A C A A C C A A G A A nem316^ai2.seq 

TTCTC AGTCCTCCACC AATAATCATTCCTGTTTTAGGAA G AAATGATTGT Majority 
2160 2170 2180 2190 2200 

i — : ■ 1 i i— 

2051 TTGTCAGTCCTCCACCAATAATCATTCCTGTTTTAGGAAGAAATGATTGT 2603_al2.seq 
2151 TTGTCAGTCCTCCACCAATAATCATTCCTGTTTTAGGAAGAAATGATTGT nem316_al2.seq 

GGAAAAAGCGG T.T GTGATGGTTTAGGATTT. CTTGGTGG A CG AC TTTCTT T Majority 
■ ■ 1 ; 1 1- — 1 r- J J 

2210 2220 2230 2240 2250 

■. » ■ l- 1 1 : I_ 

2101 GGAAAAAGCGGTTGTGATGGTTTAGGATTTGTTGGTGGAGGAGTTTCTTT 2603_al2.seq 
2201 GGAAAAAG C G GTTGTG ATGGTTT AGGATTTGTT'GGTGGAGG AGTTTCTTT nem316_ai2.seq 

TTGGTTTTCTACCTCTACTTCCTGTGTTTTATTACCAACTACAGCAACTA Majority 
« • • * i 1 i 

2260 2270 2280 2290 2300 

1 1 : 1 1 : 1_ 

2151 TTC GTTTTCTACC.TCTACTTCCTGTGTTTTATTAGCAACTACAGCAACTA 2603_al2.seq 
2251 TTjCGTTTTCTACCTCTACTTCCTGTGTTTTATTAGCAACTACAGCAACTA nem316_al2.seq 

C'A GCATCCTTCATAGATATACGGTAACCAGTTAGTGCTTTTGCTTCTCGA Majority 
2310 2320 2330 2340 2350 

: 1 ■ 1 i l_ 

2201 CAGCATCCTTCATAGATATACGGTAACCAGTTAGTGCTTTTGCTTCTCGA 2603_al2.seq 
2301 C AG CATCCTTCATAGATATACGGTAACCAGTTAGTGCTTTTGCTTCTCGA nem316_al2.seq 

AAAATATACTTACCAGGTAATAAACCTTCAACCTCAATTTCTCCCTTATC Majority 

1 » " ' " ' '• '"l """ ~ T" 

2360 2370 2380 2390 2400 

« 1— l: : I • I 

2251 AAAATATACTTACCAGGTAATAAACCTTCAACCTCAATTTCTCCCTTATC 2603_ai2.seq 
2351 AAAATATACTTACCAGGTAATAAACCTTCAACCTCAATTTCTCCCTTATC nem316_ai2.seq 

ATCAGTTACTAATGAAGTAATCCCATCTTGATCGG. TCGTAAATCGTCCAT Majority 
2410 2420 2430 2440 . 2450 

jl 1 : j 1 . 1— • ■ 

2301 atcagttactaat- gaagtaatcc. catcttgatcggtc gt aaat.cgt c c a t 2603_.ai2.seq . 

.2401 A T C A GT TACT AAT.GA ACT A A. TC ATCTTGAT C*G G T G GT AAATCGTCC AT nem316_ai2.seq 

TTTTAAAGCGAA C T G G C T G A T T C T G G T T A T C G T A T A. A T A C A A A T A T T A CT Majority 
v* i * 1 — : — : ! r — '• 1 : : r- 

2460 2470 2480 2490 . 2500 

i — J i , t , i i_l 

2351 . TTTTAAAGCGAAGTGG CT GATTCTGGTTATCGTATAATACAAATATTACT 2603_al2.seq 
2451 TTTTAAAGCGAACTGGCTGATTCTGGTTATCGTATAATACAAATATTACT nem3l6„ai2.seq 

CCTGATAGCCTTTTCTTTA ,TC TTTCCTTCTTTTGTATATTTAATAAGTTT Majority 
•« ■ « ■ i « 1 

2510 2520 2530 2540 2550 



2401 C CH GATAGCCTTTTCTTTATCTTTCCTTC T.T TT'GTATAT-TTAATAA GTTT 2603_al2.seq 
2501 CCT-GATAGCCTTTTCTTTATCTTTCCTTCTTTTGTATATTTAATAAGTTT nem3l6„ai2jseq 

- T A A T C G G C C T G T T T C A A C T T T T C G C T T A G G A T T TATCTGTAATTG A TTT G Majority 
1 1 1 1 — : : f~ 

2560 2570 . 4 2580 2590 2600 

' 1 1 L_ 

2451 TAATCGGCCTGTTTCAACTTTTCGCTTAGGA T T TATCTGTAATTGATTTG 2603_al2.seq 
2551 TAATCGGCCTGTTTCAACTTTTCGCTTAGGATTTATCTGTAATTGATTTG nem3I6_al2.seq 
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Alignment Report of Ai-2Lah'gnn>enl, using J. Hem method with Weighted residue weight table. Page 5 
ThMisday.Jbfr 29.200^^ Cj; Q jg /' jj^Tg jgjl 3 jQj . . 

ATAACTTATCATCTGCTATTTCAATATAAAAAGG T A C TATTGTTCA AACG Majority 
r— — 1 1 i ; 1 r- J J 

2610 2620 2630 2640 26SO 



2501 ATAACTTATCATCTGGT AfifT TC A AT AT A A A A ACCT ACT ATTG TTG A A A C C 2603_ai2.seq 
2601 ATAACTTATCATCTGGTATTTCAATATAAAAACGTACTATTGTTGAAACC nein3I6__ai2.seq 

CTTTGATCAGCTTTATAAGCACGACCAAAGTACGAACCATTTGGGAGTGC Majority 
1 1 1 1 1- 

2660 2670 2680 2690 2700 

1 1 1 1 i_ 

2551 CTTTCATCAGCTTTATAAGCACGACCAAAGTACGAACCATTTGGGACTGC 2603_ai2.seq 
2651 CTTTGATCAGCTTTATAAGCACGACCAAAGTACGAACCATTTGCGAGTGC nen316__ai2.seq 

TATCTTTGTCTGAC.CATTACTATCACTACCACAAGTCAACATACTCTTAT Majority 

2710 2720 2730 2740 2750 

1 1 1 t_ 1_ 

2601 TATCTTTGTCTGACCATTAGTATCAGTAGGAGAAGTCAAGATACTCTTAT 2603_.al2.seq 
2701 TATCTTTGTCTCACCATTAGTATCAGTAGGAGAAGTCAAGATACTCTTAT nem316_ai2.seq 

A CTTCTG'CTTCAAT TCGCTATCTGTCAT TTGGCTCAA T A A A T CAACTTTT Majority 
1 1 1 1 r~ * J 

2760 2770 2780 2790 2800 

1 1 1 1 — 1_ 

2651 ACTTCTGGTTCAATTCGCTATCTGTCATTTGGCTCAATAAATCAACTTTT 2603_al2.seq 
2751 ACTTCTGGTTCAATTCGC TATCTGTCATTT.GGCTC AATAAATCAACTTTT nem316_al2.seq 

AAGTTGTCAGTCACAGTCCATAA'aCGATAACAAATCCCCTCCTCTGTAGT Majority 
1 1 1 1 r- 

2810 2820 2830 2840 2850 
1 t-t : I I 1 — 

2701 A A G T T G T C A 6 T C A C A G.T C C AT A A A C G A T A A G A A A T C C C C T C C T C T G T A G T 2603_al2.seq 
2801 AAGTTGTCAGTCACAGTCCATAAACGATAAGAAATCCCCTCCTCTGTAGT ne*316_al2.seq 

ATTTGGCTGA AGTCCTA T CTGTGTGATTGTTAGTTG ATTAGG GGTA TC AG Majority 
i 1 — — T , ■"■ ■ a 1 4 

2860 2870 2880 2890 2900 

; 1 f : 1 j 1- 

2751 A T T T G G C T G A A G. T C C T A T C T G T G T G A T T G T T A G T T G A T T A G G G G T A T C A G 2603_al2.seq 
2851 ATTTGGCTGAAGTCCTATCTGTGTGATTGTTAGTTGATTAGGGGTATCAG nera316_.al2.seq 

CATTTACACTGGCTACCGAAAAAAACGCTAATTGTACCAATCCTAAAAAC Majority 
1 1 1 1 : X~ 

2910 2920 2930 2940 2950 

: : i : 1 1 1 : 1— 

2801 CATTTACACTGGCTACC'GAAAAAAA'CGCT. AATTGTACCAATCCTAAAAAG 2603_al2.seq 
2901 CATTTACACTGGCTACCGAAAAAAACGCTAATTGTACCAATC CT A A A A A G nem316_ai2.seq 

CAACATAGTACAAGTCCTAAA'CTTTTT'CTAATCTTTTTCATTTTTGATTT Majority 

2960 2070 2980 2990 3O00 

1 1 : 1 1 ., 1_ 

2851 CAACATAGTAGAACTCCTAAACTTTTTCTAATCTTTTTCATTTTTGATTT 2603„al2.seq 
2951 CAACATAGTAGAA.GTC'.CTAAACTTTTTCTAATCTTTTTCATTTTTGATTT nem316_ai2.seq 

CCCTTTCTTTTTCTC TCT TT A A AT-TTTC G TTTT A A AT AT A AT AGTAAAGC Majority 
, ■ , , , 1- 

30 1 0 3020 3030 3040 3050 

: _! 1 1 ; 1 !_ 

29Q1 C C C T T T C tQt T T C T C T C T.T T A AATTTTCGTTTTAAATATAATAGTAAAGC 2603_jai2.seq 
3001 CCCTTTCT TT TTC TCTCTTT A A ATTT TC GTTTTA A AT ATA AT ACTA A A GC neat316_.ai2.seq 

GACTAATATAACAATAACTAGGATTCATAAGAGGAAATAAAGTTTATACT Majority 

. 1 — : : 1 1 r- 1 : r- 

3060 3070 3080 3090 3 100 

1 : 1_j 1 : 1 i_ .- 

2951 GACTAATATAAGA AT AACTAGG.ATTGAT A.AG A G G A A A T,A A A G T T T A T A G T 2603_.al2.seq 
3051 GACTAAT'ATAAGAATAACTAGG.ATTGAT.AAGAG'GAA.ATAAAGTTTATAGT nem316_al2.seq 

GTGTTTGCAATTGTTfCA T.T. -A A A TAG TTC.TTTTCTT f A ACAGG A GCt AC A Majority 
3110 . . 3120 - 3130 3140 3150 

. 1 , : « - 1 : 'j. l : : l_ 

. 3001 GTG'TTTGC A A TTCTTTCATTAAATAGTTCTTTTCTTTAACAGGAGGTACA 2603_al2,seq 
3101 G T G T T T G C A A T T C T T T C A T T A A A T A G T T C T T T T C T T T A A C A G G A G G T A C A nen316_al2.seq 

T A C T T G A T T C G A T G C'C C T C TAACTAGTA A ACG ATGTGAATTAATCGAAt A Majority 
. — . 1 — , . , , 1 r* 

3160 3170 3180 3190 3200 

« 1 1 : 1 , — l_ 

3051 TACTT.GATTCGATGC'.CCTCTAACTAGTAAACGATGf GAATTAATCGAATA 2603_ai2.seq 
3151 TACTTGATTCGATGCCCTCTAACTAGTAAACGATGTGAATTAATCGAATA nem316_al2.seq 

A GGTGTACATGTTAGCA A A G T C GCATAATCCTTACCTTTAACAACCAATA Majority 
. ( , , . , — r- 

3210 3220 3230 3240 3250 

1 : 1 1 1 f 

3101 AGGTGTAC ATGTT AGC A A AGTCGCATAATC CTTACCTTTAACA-ACC AATA 2603_ai2.seq 
3201 A G G T G T A C A T G T T A G C A A A G T C G C A T A A T C C T T A C C T T T A A C A A C C A A T A ne«3l6_ai2.seq 
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3260 3270 HI " T ~ 
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3151 
3251 



3201 
3301 



3 ?° 328Q 3 290 3300 

GCTAAAACTTCTTTCATATTATGAATATAAAAAAT T TTTCCTTTTTTA a <i 
33 - 10 3320 J330 3340 33^0 



33S1 
3451 



3401 
3501 



— — — -* - t ~~ — <*»w 3350 

TTTATCTAAATCTGTA A . T t A C I T U CTTTU 6TU » c C C C A T n C r t C 
2_L 3380 3390 3^00 

TGATAACAr,TATGTCAACTTTTTCCACCAATTC (; r.,. r, G AGCTTr.CTT f -, ^, orlty 

3410 3420 34^n . ^aah, 
• f - . ouu 3440 3450 

AGG'TGTCCTGCTCCTTTTT CAAGA A C A CT ACTCGTAiiTr.C-CCGCATflOfl'T 
34 , 70 3480 3490 ^00 

AGGTAATTTTTCCTTGATA. GACGGTATATCAATATAT Q cAATCATTTf a fi ^ orlly 
— - ^ 20 , 3530 3540 . ^50 

TAATCTCAAGCATGTGGCCGTATTCAGCAATACn T TTTTT T - TnTTTTTCA 
- 35 , 60 3S70 3580 3590 ~ 3^00 

GTATACGGATCTGATAGGCCCCTTGCGTCCAC T r.TT C TATTATA Ar.r-rx T ^ orIty 
— ff? 3630 3^0 

TGCTAACTCAAATCCTCTA.TTAATCTCTTTAGTAT T TAATTTTTGGCTtt 
- ^2. _2_ 3680 3690 3^0 

CATTATCAAAGTTAGTTACTTGATTATTAGCTTTA A TATTATAG.T,rr,. h*,^,, 

TTTGAAATAAAAGG A T -T.A T C A G G T T A T € A A A A G A C f:- A A'C T A A' G A A' C A. A t a MaJorit ^ 
3760. , . 3770 3786 3790 HI 



3601 
3701 



T ATCACC CCT AC ATTC ATC C.ATCGATTT A AA Afc G ACCi ; ATTTCTT AA Rr. t Majority J 
38 -'° 38 . 20 3830 3840 3850 

TTTTCTGAAA TTTTCCTCCCATTATGATTC A A TTr p xT-TTCTA A C. A rtTc .faj^y 
i 22 fSTO • 3890 39^)0 
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c t a a a E§B5| g3BH gS^r a idl Sgr att ucchactaaccaacca ^joTI^ 

. 3910 3920 3930 _ 3940 ' 3 950 

3801 CTA ^CCATTTTTTTGACGTTGACCTTTUTTA ACCAAACTAACCAACrl ^ 

CTAAACCATTTTTTTCACCTTCACCTTTTATTAlm'acnuauci "ai^seo 

ATAATAACTAAACATATATACAATAGAT A TCTATA AATCGTGTTTA AATG Majority 
3960 3970 398° 3990 4000 



3901 
4001 



3851 ATAATAACTAAAGATATATAGAATAGATATCTATAAATC GTGTTTAAATr o fin o q4 -> ^ 
3951 ATAATAACTAAAGATATATAGAAHACATATCTATAA^CGTGTTTAUU SlKiSeq 

ACCTTCTT TTATIAAT TTTTCATCAATAGGACCTTTATAAGGCATACCAT Majority 

4Q , 10 4020 4030 4040 4050 

A C CgT CTTTTATTAATTTTTCATCAATAGCACCTTT ATAAGGGATACGAT ?etn ,i? ^ 

CTCCCCTTACTAAAAGTCTGTGTCTATT C ATCATAATCGGCGTCCAACTT Majority 
4< y° . W0" 4080 4090 " 4100" 

GTCCCCTTACTAAAAGTGTGTGTGTATTGATCATAATCGGGG TGCAAGTl Si 2603 ai2 sea 
4051 CTCCCCTTA.CTAAAAGTCTGTGTGTATTC aQ CATAATCGCGGTGCAAG TT ne^T6^.aI2^eq 

AATAAGCTTCCATAATCATGTCCAGGAAC AACCAACAAATCTGAAAAGTT Majority 
4 V° «2» 4130 4140 4150 

4101 AATAAGGTTGCATAATCATGTC CAGGAACAACCAACAAATCTGAA A A GTT nensTe.aSeq 
' ATC GGGTGTAA CGACTTTT ATCTGATCTACTTCATATG CTATCGTTTCTT Majority 
1 ^™ 4170 4180 4190 4200 

A TCGGGTGTAACGACTTTT A TCTG.ATCTACTTGATATG" CTATCGTTTCTT 2603 a!2 sea 
ATCGG'GTGTAACGACTTTTATCTGATGTACTTGAT- ATGCTATCGTTTCTT ne^T^^L, 

TTATGTTTTGAATAtAAAACTTATCTCCTT T TTTTAACTTTtTAAGGTTA Majority 
4 210 . 4220 4230 4240 



4051 
4151 



4101 
4201 



4250 



T^TG?T^G A A^ 2603_ai2.sea 
TTATGTTTTGAATATAAAACTTATCTCCTTTTTTTAACTTTTTAAGGTTA neoaiG.ai^eq 

GAAAAC.ACTTCTTTATCTCGAATTCCTG A GTGCGCTGTTATAACGGTATG Majority 

- 2?? 4270 . 4280 4290 4300 

4151 CAAAAGAGTTCTTTATCTGGAAT TC C T G A^iT GCGCTCTTATAA c Bg T A T C jusim <^ 
4251 GAAAAGAGTTCTTTATCTGGA'ATTCCTG AG TGCGCTGTTATAA C G G T A T G ne^76^l2^seq 

TGTGCTATTTCCTCCAATTGCAAGAGAGG T ACCTTCTAAA.TCCCCTGC.TC Majority 

43 *° ^ 4330 434 0 - 4350 

4201 TG THCT ATTBCCflCC A ATTGGA AG AG AGCtA CCTTCTA A ATGC CC TGCTC 2603 a!2 sen 
4301 TGTGCTATTTCCTCCAATTGGAAGAG AQ GTACCTTCTAAATGCCC TGCTC n^ltllTL, 

CTTTAGATAGAACTTCTTGAC T T CAACCT G C AAATAT AG GG AGTTTTTGA Majority 

j 3 ™ <370 • 4380 4390 4400 

4251 C TT T A GAT A GAA C TT C TT.G A CTTGAAC.C T G C AAATATAGGGAGTTTTTf' A ? fi m fll ? ™ 
4351 C T T T.A 6 A T A G A A C T t.C T T G A C T. T G .-A. A CCT> C A A A* A 6 b G A G T T T.'T T G 

C C T A T C TUG G A AC TC A A AT T CT TC C G A TT T TTTCACTTA CC T C T A A C A T Majority 
' 44 J° .. J : 4 *g° • 4 ^b \ 4440 : • 4450 * 

4391 ^ C T A T C T T AC G A A C T G A A A tT G TT C C G A T T T T T t C A C T T A C C T C T A A C A T 2fi(tt ^ 
4401 CCTATCTTAGGAAC'TGAAATTG t|J| CC'GATTTT-TTCACTTAC CTCT AA ^ AT S^eq 

CGTACTCTCCTACCCCCTTTT GAAT'TCGTT TT.t't CTCATAAGGAT Majority 

• 4460 4470 4480 4490 4500 

4351 ACGGGCGTACTCTGCTACCCCCTTTTGAATTCGTTTTTTCTCATAAGGAT 2603 ai2' sea 

4451 acgggcgtactctgctaccccct.tttgaattcgttttttctcataacg^ 

CTTCAA-GATG-GACATTATTTAAAGAATCA T T-ATAACCTTGTGCTACAGTC Malori tv 
4S . 1Q 45 ? Q 4530 4540 4550 

4401 CTTCAAGATGGACATTATTT.AAAGAATCATTATAAGCTTGTGCTAGAgBc 2603 al2 seo 
4501 CTTCAAGATGGACATTATTTAA.AG AATCATTATAAGCTTGTGCTACAGTC nSl^ 



A C G G G 
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ATACcfe EgTO ttSg j^'dS Hj jgj R TTTTTTACCACCTCTCTCAAA Kaiorirv 



4451 



4560 4570 4580 4590 



4600 



- aI a r rW r 5 A A T T T C TTTC T G A C T A A G TT T TTT A G C A C C T C T C T C A A A 2603_ai2.seq 

4551 ATACGBCGATTGATTTCTTTCTGACTAACTTTTTTAGCAGCTCTCTCAAA nem3T€_ai2.3eq 

ATCCTCTGTTTGATTATTAG A TTCTATCGTATAGTAAAAACGTGATACCA Majority 
46 . 10 ^ 4630 4640 4650 

SI A I^^ T 5 TTTGATTATTA ^ ATTCTATCG ^TAGTAAAaIcGTGATACCA 2 603 ai2.seq 
.4601 ATCCTGTGTTTGATTATTAGATTCTATCGTATAGTAAAAACGTGATACCA nemalelal^eq 

CTGGATA CAATAAAA TAGATAGACCTATTAGAAAAAGAATGATAA. AAGGA Majority 
46 1 60 . 4670 4680 ~~ 46*90 4700 



4651 CTGGATACAATAAAATAGATAGACCTATTAGAAAAAGAATGATAAAAGGA nemsTe^i^eq 

AGATTTGACTTCTTCTTTTT TTTTGTTT. TTTTGTTGATTTTTTTAGTCTT Majority 

4 \ l ° '4720 4730 4740 4750 

4601 ACATTTCACTTCTTCTTTTT TT T T GTTTTTTT gKI TGA*TT. TTTTT-A ctptt 2 603 ai2 sea 
470t AGATTTGACTTCTTCTTTT TQHBfl G. T T T TTTT GT T G A tB TT.TTTAGTCTT Z^ai^eq 

CACGTCATCTCCTAGATAAT GGCTCTTGCTTATGATCtAAGAG T AC TTCT Majority 
47 , 60 47TO 478Q - 4790 4800 

4651 CACGTQATCTCCT AG ATAATGGCTCTTGCTTATGAT'CTAAGAGTACTTGT 2603 ai2 seq 
4748 CACCTCATCTCCT a|Ja TAATGGCTCTTCCTTATG ATCTA ACAGTACTTCT neaaTe^aiHeq 

A C T G A A A T A C C C TT A G AT C AT A A G C AC-A G CTTTAAC TGTGC TT ATA CATC Majority 

48iO 4820 4830 4840 4850 

4701 A. C TGAAATACCCTTAGATCATAAGCACAGCTTTAACT gQ GCTTATAC ATC 2603 ai2.seq 
4798 ACTGAAATACCCTTAGATCA T A A G CACAG CT TTA AC T G TGCTTATAC aQc «ea3l6_al2.seq 

ATCAAAGACTAGCCTTA AGCTTCCTTTGATTGGCGTTTTTTCATGATAAC Maforitv 

48 t 60 ' 487Q 4880 4890 4900 

4751 A T C A A AGACTAGCCTTAAGCTTCCTTTGATTCGCG TflT T XTC A T G A T A A C 2603_ai2.seq 
4848 ATCAAAGACTAGCCTTAAGCTTCC tQ T G ATT G|| C GTTTTTTCATG aQa AC. nemslelai^eq 

TACTGCTCCAACCATAATGC TTAAACCAATAATTCTGAAAAGAATTGTAC Maforitv 
* 4910 4920 " 4930 4940 ' 4950 

I!"L CTCCAACCATAATGCTTAAACCA ATAATTGTGAAAAGAATTGTAC 2603 ai2.seq 
TACT-GCTCCAAGCATAATGCTTAAACCAATAATTGTGAAAAGAATTGTAC ne^76_ai2.seq 

CAATACCACCTGTTTGTG GGATTGT.TACTTTTTTGTTTTGTACTTGTTTC Majority 

. 4960 4970 4980 4990 5000 

4851 CAATACCACCTGTTTGTGGGA TTG TTA CQT T T T T&T T T T *T A c|H G TBhjS 2603 ai2.seq 
4948 CAATACCACCTGTTTGTGGGA tQ CTTAC'TT TT TTGTTT T (III A cT?C TTUfc nemsls^l^seq 

CCATCTTTTTTTACAGGTT TTTTGTTATCTGCGTTGTCA. GTTTTAG C.CC C Majority 

SO* 0 ■ ■ 502O 5030 5 040 ' 5050 

4901 CCAT C T I 1 1 iMTH Tv jHBI T G T X aM^SBTc GtHg t'.C a|N| tW T T aIB rKl r r 2603 ai2 sea 
4998 G C A TC T T T T t J^T^^^X^H^j^H^^Tn f B A^tThP^.C G^G T C *K^f ^c"P;^I^a& 

TTTTCTGTA TC A T G XT T G;A T T: T A C T T C A. A AC T T TATA TT -A.'CC T G C C A ATT Majority 

. . . : 5690 ' .5100 

4945 . TDt^HT G X A i G ^llB^TTGATfllA C T||C A A AflT Tfl A T'A TTA CCTGCCA Afli 2603 al2.seq 
.5048 TT Tffic TUT A T.G aTg^tJHSB Affl T T ACTTCAAA'G T TT Afi A tB>A C C T cfic A AtB ne-aie.ail^eq 

TCCCATATCCTGCTCGTCC T T G TCTTTCTTCCACCf T.CT A AC TGCCTTTT Majority 

. s »° 51 20 5130 : SUP • ' S150 

j995 TH G C AB AOT C C T G C T C GB G CBQ G T G T T T C T T CB A G G T T G T A A G tQ c cHtBi; 2603 ai2.seq 
5098 TCGCA TAfl C CT'GCT'CGTGtTT G.Q GTTTCTTGC AB cfl Tfl TAAGTC.CCT TtB nebsTe^aiHeq 

TCCAGACCTGTAATTTCAAATTCACCTTG6TCCTTTGA.GG.TGTAT.TTAAT Maloritv 

. 5 * 60 5T70 5180 5190 " 5200 

5045 fa CflA£A C CTG T A AflT TCAAATTG ACC TT G^T CCTTTC aBgtCT aIMH T aTt 2603_jai2.seq 
5148 TCCABAC CBC T A A T T T C A A A T T G A C.cnHBfcB| CGTT|Jg A ^HtSSB A T T T A Afl neffl3l6_ai2^eq 



4801 
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5095 CG CT 
5198 BB C 



523Q 5 2 4 0 

[TTATCTC TT ATCC AQ T CAT aBc C T"B T 



A G CH T 2603__aI2.seq 
GTTAGCTT nem3 I6_al2_seq 



aQ CCATTCATATG 

TAATGAAGC CTTCA TTGTAATCTTTTTGTTTTTTTTTGATACTTTCTTTT Majority 

52 . 6 °_ ^Z 0 528Q 5290 5300 

5139 B A A - T _°- A A G GClflc ATHGTAATC tWWt T CT TtW I Ittt t r. A I aIWt t rTTt ™, _ 

GCTGTTTTTCf!TTTTTCri*»/*T«r-r-r-i'/'T-pi-T-rT^^-.-^-««- . ; . _ 



GCTGTTTTTCCTTTTTGGCCTT TTTGTTTTTTTTTACTCAAGTTGTTATA Majority 
5310 5320 5330 5340 5350 



533Q JOW OJDU 

5189 GiHg tBBt T C C T TTT tBB CCTTTTT gBtHtHt tBB A g||c A aBt T G T T A T A 2603 ai2 sea 
5298 G C.T G T T T xH G G T T||TT G G C C T t||t T G?t"t^™a G T C A AG tQ G T TAT* ne^T^ail^eq 

AGCTTTGATAGCTTCATC TAATGCTATTTTAGCAGTTTTTGCAGCTGCCT Majority 
5360 5370 S3SO 5390 *r*™ 



5360 5370 5380 

I G AT A G C T T C A T C T A aQg C T A T T T t| 



5239 A G CbHOG ATAGCTTCATCTA A||G C T A T T T f^^k*>4M TlKl 

5348 ACC TT tB AH A G C T T C A T c|] A ATGCT aIJt TTAGCAGTTTTT 



5400 





I7d 







2603„al2.seq 

"MU 1 * 1 a b u a i» 11 1 1 i|] UAGCTGGCT nem3l6_ai2.se 
TTTTTTGA CCTTTT GCTTCATCTCCTTTACGTGCCAGGTATTTTCCTTCT. Majority 



5410 

5283 T T T T TTG Aflj C t T 
5398 T TED T TH A G C ~ - 



5420 5430 5440 5450 

ITGBtTGATCTGC TTTaAgIIgcHAG GTATTtHcCtBcT 2603 al2 sea 
' T C C TEH aB CTGCTTTACGTGCCABgTATTTTCCTTCT nein3T6_ai2^e q 



GAGTTTTTCAC.A AG GAATTTTGCT CCTCCCAACCCTTCAGTTTfAGCTTG Majority 
5460 5470 5480 5490 5500. 



5460 5470 5480 54 90 5500. 

f?? ^AG T T T TTCACAAHgA A iJlT G CQ C cfl G C C A aBTcTB T C AlillT ifflH l l|| 2603 a!2 sea 
5448 Bll tBttBA CAAGGA aBBRH GCTCCTGCCA AG C GT T C aT$BtTaTTt T " ne^Te.ai^eq 

A T TT G T t T T C A C A A A T T T C T T T C C A T.GA CTTTC C A CTTTT CGTTCTCT f C Majority 



5380 
5495 



A T T|_ 

ATTTGT 



5S . 10 f 5 ^ 5530 5540 5550 

'H'CTT t TJ_ A C A A A TT T C T TflCC A T GACt TTCBa C T T T T C r. t t rW c t t n 2 603 ai2 s, 
ITGTTT TH AC AAATT TfiBBfT CCA TH A G THB CCA CTTTTGGTT C T cfl T G ne^alf 



5550 
i_ 

GT^TG 2603„ai2,seq 

.seq 



GGTTG ATTGGTGTTGGGT TATTTGA GT TTTTGTTGTTGTTGATGGTTACT Majority 
5560 5570 5580 5590 5600 

5430 CGTTiAlTCCTCTTCCCTT A t|_B G aBtQt TT gB'tBt T G_8 tH ATGGTTAf T wan a ,7 ^„ 
5545 G G T T C A T T G GH G T T G gB TH A T tr" G A G T T T T T GT T- G T ?§f?? G A t£g tQ A C_| 

TTACCATTTTTTAGT.TTTATT TCTTTTGGAGTGTAGCCGGTAACACGTTC Majority 
5610 5620 5630 5640 5650 

L_ 

„AACACGTTC 2603_al2.seq 
TA AC AC GT T C nem316_aI2.seq 



5610 

5480 TT .ACCATTT tI_ 
5595 ID A C C A T T T T T| 



5630 

1 1 tIABBQTt~a Q C Cb» 

I T G G AG f GT'AGC C 6| 



:GAT GCTGTATGTTTTGGTATTTTCTAGACCTGTGAATTTACCCGAGT Majority 

5700 

G& 2603_jal2/seq 



5670 



St.G T A .T-(?Bt TT 



5680 
- 1 - 



A T T TTC C A G .G TGTTTTC GTJ T T 
5710 , 5720 

ItHHt T 



i T T__/ T V JL A__A G Cfl G T G A A X f -t A C . C C ,G . A GB 

I'A'H A c c t g t g a kWEBMBBKUL* g't nein3i6._ai2.seq 




*tfpnc t aBg t TT tbIs't t n t < 

r T G T -T'-C C T T T TtGTCACGT C G C ft t T AC C T Majority 
i - c7ia ' . 5740* . *• 5750' 



^»Jt T T GQtB Cl 

tCGt'TTTGTTGC 



.5730 



S_t_r tDBBIa g GTCJ 
J t e t c ABaf 



- *«2 A C CT 2603_ai2 . seq 
. -— ' - ~ r * * " ~ * »™ui » « * w aw »UUU ^ia T TQ A C CH he_3I6_al_.seq 

A T G T TT TTGTTTG TTTTTCTtGCCCCGTAGTTTT AT C T T T A T.C TTT Majority 
. . fj*g S770 5780 5790 5800 



5770 

5624 T. tHH a T G T T T T T G t WIW Tfllj | c tLwk 
5737 HTG T aOBBBtDBg T T TGTTTTTCTTGC 

■ C.AATGTGTAAGCAACGTT TACATCATTAGTTGTACTTTCTTCTGCCCAGT Majority 
5810 S820 5830 5840 585^0 



■ J — : — , : i_ 

^_A GTTTTATCTTTATC TTT 2603_lai2.seq 
G G T AIBBt AfflcTTTABc tBB nem3l6„al2.seq 



5830 

1C A A C G T T T AC AQ C A Tfl A GM T GQ ABT JH ( H 
|G C.A A CUT tB(a CATC a|}t A_J T T G T A CGHt C Ti 



T cQgDc C AE_B 2603_ai2.seq 
CTGCCCAGT nem3i6_al2.seq 
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5860 



5870 



5724 



TT T T A CBBA CtB-TT AflT TC 



5880 



. * . » A ^ 'iniiAui r cum t T T gHH g g t g t a'B cfem gIW r; t mfl 

5824 BTTT AlfrT A C TT T fl.A TTTCTTGGTT|CCTG gB G T ACCTt|S(tT G tBgTMII 



5890 



^JT C 2€03__ai2.seq 

1 nem3i6_ai2.seq 



CTTTCTTTCGTTGGTTTGTTACCATACTC C AATTTGATATCATTGCTTTC Majority 

5930 5940 5950 



5910 



5774 I M HlCTT 
5874 C T T T C T Tl 



Be G T 



G G T T T G T T A C C A T A HUM C AATTTG ATATCATTGg||tTC 2603 alZ seq 
TCgBttBtTACCATAGTCCAATTtBaHaTCATTGgTttC nentfle'a 1 2^eq 



TGGATTTTCAAT AGTTGT. T TGGCCGTTCACCGTAGCTGTGTAAGTGATTT Majority 



5824 T G G A T T| 
5924 TCG Af 



5960 5970 

— . T C A A T AHT T G fl T T C 
T T C A aQa G t|] G t BBH 



5980 



He cQttQa cQ ctag Ci 

GCCGTTCACCGTAGC TG| 



6000 

TjfiTA ACtBa|HtJ4[ 

GTAAGTG A THT 



2603_aI2.seq 
nem3 I 6_a i 2 _ seq 



5974 



5924 
6024 



TGATTTCAATAtCTTTTTTTTTTGCTGCT T TTTCTACTTTTCCCAGTCCT Majority 
6010 g°^0 ■ 6030 6040 6050 

Q^TTTHg "GCT^TTCHABTTjjm^ 2603.JU2.seq 
In Till TTT*r c r W\ c rWWVWm -rfffl n t * r> T-Wmn -r> ^ ~M . TTlf* ~ 1 



5874 Tl|Af]T TCAATATC 



TGATTTCA AB A T CTT 



|C T A C tBITG cBagBc C T nem3i6_at2.seq 



TTTCCTTT GAATTTTA CTCTCAAACCTTCGTCATCTGTTCTCA G T TTCTA Majority 
6060 «>70 6080 • 6090 6100 

IctI 



"tHB g ctBFH* 
Bit g cflT tB 



A A TTTT A 
A AimQS A G 



lij I C A A A C cBMffcTclB C A T CtW Tl 

G AAACCTTGGTC ATCTG t T| 



A G T tDBt A 2603.jai2.seq 
GAGTTTGTA nein3l6_ai2.seq 



GTTTGTAT CCTTAGGAA AATTTTTTGTATTTCCTGTTGCTTTAAGGTTGT Majority 



6110 6120 

5974 G.TllT CTATC CTT AGllAAA aAt.T fl 
6074 CTT TEH aB cflBB A G G A A A A tB T T 



It g t i 



Ia t 



6130 6140 6150 

B T T C C T gBB G C T T T ATG~GTffTl»!f 2603_al2.seq 
■MIT G T TOG T T T A ABC T TGT nea*316_ai2.seq 



6024 
6121 



■TAGTTGAACCCATTGTCAAACCTTTTG T CA.TTCTATCTGTCCAAACCACT . Majority 

6I . 6Q 6170 6180 . 6190 ~62o" 0 

: ^CCC ATTG i C A A A I ilJYVl &PT A Tt|B T A T CYg TCCAAACC 2603_ai2.seq 



Da gttg 
tHg t t g 



A A C| 



|G T C A A A C C T T TBG T C A tDTC T A T C 



I G T C C A A A C C A G T nem316_al2.seq 



T TJT tB|G T A T.T Tgc ABC CTTTG'TGA ATTTTTG.TT.T.T A A f!TT 
6165 T tQ TIjB t aHtBTt G A G C C T T Tflfl P A A T T T T T G-T iQB A A C tB 



6124 



ITTTACCGATTT 



TTTTTCTAT TTTGAG CC TTTGTGA A.TTTTTGTTTTAACTTTGT'ATTGTTT Majority 
6210 6220 6230 . 6240 6250 

It aO&GIUMM 2603_ai2 .seq 

I T G T A T Tfl T T T nem316_ai2.seq 

TTTTTTACCG ATTTTCGC TGTGGCAGTTGCTTTGTCACGTGCATAATTAC Majority 
6260 6270 6280 6290 6300 

fir C A CTT G CTTTCTC A.CC f Q GAT A A T tTc" 2603.jai2.seq 
«C ADHD C T T T G T cBBBBBBBHBBBBBBi nea316.jai2.seq 

CATAATTTCCGCCAGCTGT C AAAACTCTATTTTCCTTTGTC.AGTGTTGTC Majority 

S 3 . 10 6320 6330 6340 6350 

•^IMXl.^G CG.CC.AC C T CTC A A A A C t C TA T iIWcMtMt n T C. aKIt r. W t r. Fg 2603 ai2 seq 
6251 '■^^^^^^■^^^■■■■■■■■(■■Bi T GW:TG TC AG fn^BHt C ne^is^izj^ 

A A GTTGTTT G TTT T T G C A A A C T .T T T t A T C T ; A T TT G T G- G T.T T T T- fvlf CA'GT Majority 
6360 6^70 6380* . .6390 . 6400 



6215 T T tfltD A C GB aBtDFtC G C f gBTgI 



.6224 



t ^^S^D I I ^ H G C ^ A AGtTTTTAT gAaTT Tfl T G G T T T T^Ht T C A G T 2603 ai2.seq 

A A CBlTB T T TH tQB*B T G C A A A G T tB T T A T C T AH T T G T ft G T T T rltf? «r x r A is -r ..f?.. 



* " V T * f m ■ • * * . * ^ x l ivLH l 1 U-A-G T 2603 ai2.seq 

hrCCAAAGT TBT T A T C TAOt T G T G G T TTTpT T C A G T nem3l6_al2Tseq 

G T T C T t T G G A T A A A T G T G G G C A T C T T T A A C A A C A C C T t G t T T G T T T A C C A Malorltv - 

. ' t*} 0 6420 6430 6440 6450 

6274 GT T C T T T G G A T A A AflQ T G G G C A T cIBBIa A.CAACAC cH tB T llWI T T T A C C A ?finq „I7 
"6320 ; B TTCTTTG. GATAAATGt gQ G C A.T C^^^ AACAAC AC C T T^H T^^ TTTACCA neSlCi^eq 



..A TCGCAG AGTGATT TTAACTGGAACTCCTTTTC A. A TCACGCAGCATGGAA' Majority 

64 t 6Q 6470 6480 6490 6500 

1 A G ACTC ATBTT A ACTGG A ACQGCT WTG A Afic AG CC AGG aB_GG A A 2603 ai2.seq 



6370 ATGCCACACTGATTTTAACTGGAACTGCT-rTTCAATCAGCCAHcAtkjGAA ne^ie^l^ 



FIGURE 201 



# 98/4*7 • 

££S2£? K^WO 2006/078318 vmettSKstti Weighted residue **»ghttaWe. W PCT/US2005/027239 , 

inmsday. July 29,20w o;^/ »-pn rayc 

C C I T T g^rjf^Bgj f"fiT.fe.,X cfr' g F ^ C T T P fl ACGATTTTCTACTT U a ,^,^ ' 

^ 6S .'° 65 . 20 ^530 6540 

TCCTTTTAATTTTTTGr.TGTTGAAAGCAACTCC ACTCTCTTTTTTTCCTC. Bajortty 
6560 6S7Q 6580 6590 ~" 6^0 

- » " j - > asu : n tgn sin tax ; s if 1 , i ifWaiii sssn. 



TTCATCCAGGCACG CCCTCA TCTTTATTTTCTTTTGTTTCCGGAGTATCC Majority 
6610 ^620 6630 6640 6650 



0474 _tBB[ a T CC A 
6520 Q T G A ■SB A 



wife IrWkr C A Ja CB "l& A ^£JJ^ C T| " T j£LL T T CBCC A C T A|| cfr 2603 alE.seq 

HCH A CBC CHBIC AD cT^ADBfT C XT tBTcBt T C^SHa C T A™C G „e«376_a I ^e q 

TCTTTCTTAATTAACGCTGGT CTTAATTfGTTACCTfCTTTTTCCTTAAT Majority 

66 . 60 f670 6680 ' ^ 6700 

K67 ^ CTT -lPTTA ^TTAAGGCTGGTGTTAATTT C TTACCTT CT TT T i a r.rr Ta 

■ U 1 A ^"^^^ M ™^^^^™MMBMMfBBB[cIlT TTTClHT A AT nen316.jal2.seq 

GTATTGCATTTTACCAGTTT TAT TTTTTTTCAA AGCTAAAGCAAAGAACG H,^,^ 
_f 7 .'° «™ 6730 6740 67*50 

CACCTTTGATTTCTTTA'GCTT CC T T TC ACCCA AACTAACCTTTAAGGTCA Majority 
fjfg fiTTQ 678Q 6790 ' 6800 

6623 C A C cfiT^ A T T T C T T t A r^i? r t ^ KM ^ C C * A A A AHci liM A A G , G T C- A* 2603_al2.se q 

6628 C A C CUT THATTTCTTTA G CflT CgBttCAC C C A A aQt A AGBttTa AGGTCA nemSie.aiE;^ 



TAAGGTCA nem31G_ai2.seq 
T T A A T T T G T T T A C C T T T G T A G TCTTTTTC GTTCTTAC CTTTTGTTC C-TTG Majority 
6810 6820 6830 — 



_ — " ~ , ; 00 I J ^ 6840. 6850 

6^78 Bll^jflc rWmmr^ I M^S fr T| M H A C C-raTBG TT G C T^ 2603_ai2.se, 
6678 TTAATT. TGTTTAC CBnDDT aQ T cBHtBt cQt TCf TACCJTTTG tHcCT^B he_3I6_ai2.seq 

CC AGTTACTTTTGTTAAGTTTTCCTTGTGTT T TG AC A A.TCTTCTCC A AGG Majority 
. gjgg J* 70 6880 6890 690~0 

6728. iWT AfflnTTHTHA aQtDDBg C T T G T gBBBTG A CAATCTTGTGCA AGG ne*3T6_al2 seq 

TCACTGTATT AGTTGTTCC TTCCTCCCCAAACCCTCCTGCAACTCAGAr. T Majority 
69 . 10 6920 6930 6940 6950 

6775 T C A CflG T A tBaGT T G T TBcBt CGTCCGCAA ACGCTGCTGC AACTGAG Alf? ne^ai^eq 

AGTGACGTT AAGGTCAGT AGCAGTGTCGAGAACATTGTAAGATATTTGTT Majority 

_ _ 69 , 6 ° 6980 6 9g 0 ' 7 0d0 

S^ 1 AiflT G ACGHA A&QT C A^G T Afl C Ad T GH CG AGAACATT gH A A H A T A T T T Q T T ?rt» ^ 
6825 A G T G A C'GT-T A A^T C A« T A G;C A™ t:gT C G A G.A AC £ t A ^J<jfS T lefCi^ 

C A T T T t T T T CAT XT C T-A T C.T CC T T C T T AT TTTK T T A A T C A A CA T G G T T A. ferity 

. 70 /° - , 70^0 70^3 7040 r 70^) 

6871 CA T T P T TTTCATTTCTA.TCTCCTTCTTA T T t TAGTT AATC AACAT ftWt t I 7M<* .19 ™ 
6875 CATTfTTTTCATTTCTATCTCCTfCTTATTB^U^AA^ 

ATA AT t ATGCGGATTTTAATATTA CCGCA G €> A C C A C T C C T T T C A A G TC A T G Majority 
7 °6° ,. 7070 70 80 7090 7100 

-2£ -a i 3* l&fi ? A T T A T A A T *HB' A;C CGCA.GCACCACTC c||t T C XA^GTcTTg 2603^i2,seq - 

6925 A T A AT A Tflc G G A T Tflr A A T A T T A C G G C A..G CA.CCACTC c"t T < JBBBBBBEI ne^^aiOeq 

CAATTTTAT TTAATT AATTAAGAATACTAAAGCGCATGATTTTTAATCTT Majority 

™g 7130 7140 . 7150 

6970 G A A T T T T aQ T T A A T T"A ATTAAGAAT AC TAAAGCGCA tRI ATTTTTAATTTT 9an ^ ™ 
6967 GAATTBtATTTAATTAATTAAGAATACTAAAHcBcATGA^tB^ 



FIGURE 20J 
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/ifignment Report c* WO 2006/078318 „ ^5™,,, Weighted residue weight table. PCT7US2005/027239 
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tint .* -flaa BjigA Bga 3 * ,! 



7020 
7017 



PATATCTTTTCCAAATATAA ATT Majority 
m _ ^™ V^O 7180 7190 7200 

ilJia^Jw?*?*™!? ACTAGATTTCTT ATAflCTTTTCC AAATATAAATT 2603 al2.seq 
T T THT C TOG ATATATCACTAGATTTCTTATATCTTTTCCAAATATAAATT nemaTe.ai^eq 

- CCACCTGCA }nGACATCATAGCTCC A CCTATTAAAATGAAAGATAGAAT Majority 
72 , 1Q 722Q 7230 7240 7250 

70S TcTcTtT^ 

7067 CCACCTGCAATAGACATCATA g|!t CCACCTATTAAAATGAAAGATAGAAT nem31G_al2 s eq 

TCCTTTCCCACCTGTCATC G GAATAATTCCTTTTGGTGGAATATGCGTGT Majority 
7270 7280 7290 73O0 

Z™ lrrlll^^^^^ TGrCA ™ * CAATAATTC CTTTTGGTGGAATAtGCGTGT. 2603 ai2.seq 
7117 TCCTTTCCCACCTCTCATCGGAATAATTCCTTTTCGTGGAATATGCGTGT ne^T^al^eq 

TGgTAATTAAATGCTTGTC ACCTTCCTCATCATATTCAGAAATCTCTTTA Majority 
rc/ 0 7320 7330 7340 7350 

7167 T G G T A A T X A A A T G C .T T G T C A C C T T C C T C A T G A T A T T C A G A A A T C T G T T T A n^Te.a^eq 

TT A AC AGCT A TTATAT TTTTTATC G ATC CTTT A ACC ACTTC A A A AGTT A A Majority 

3^ 73TO 7380 7390 . 7400 

JIM!: AGCrT A TTATATTTTTTATC G ATC CTTT A ACC ACTTC A A A AGTT A A 2603 a!2 seq 
XTAACAGC. XAXXAXAXT T T0 T ATCGATCCTTTA A cQ A C TTCAAAAGT T A A n^T^aiiTL, 

AATTGGTTTATTA.CT AATT TTTTG ATA ATCCTTCGGCGAAA CTGCTTCTA Majority 
74 .*° . 7420 7430 7440 7450 



7220 
7217 



7270 



» i ttp r tt! * I TAGTAATTTTTt'GATAATC C TH CGGCGAAACTGCTTCTA 2603 ai2.seq 
7267 AATTGGTTTATTAGTAATTTTTTGATAATCCTTCGGCGAAACTGCTTCTA nem3T 6 _al2. 



seq 



7320 



TTAACTGATATTTGCCATC TTTCAAATCTTTGTAAGAAATTTTGCCGTTf Majority 

• ™^ 7470 7480 7490 - 7500 

2603^12.se q 



7317 TTA ACTCAT A*T XXGCCAXCXXXCAAAXCXXXGXAAGAAAXT 
TCTCCCGTCACTACTTTTC AATT ATT AT T 



TTGCCGTTT nem316_al2.seq 



TTTTATTGGTAAATAAAGTTT Majority 
75 . 10 7520 7530 7540 7550 



Z?J° I"^L C IS ACT ^ CTtTTCAATTAT TATTTTTTATTCCTAAATAAACTTT 2603 ai2.seq 
7367 TCTCCCGTCACTACTTTTGAATTATTATTTTTTATTG G T AAATAAAGTTT ne^ail^eq 

ATAATCTTCATTAAATTCT TGAACTTCAAACCTAGCTCCTTTGAGAACCA Majority 
7 5BQ 7570 7580 ' 7590 7600 

It™ ^^^ A '^^^^^*^^AAATTCT*TGAAGTTCAAACGTAGCTCCTTTGAG"AAG'CA 2603_al2.seq 
7417 ATAATCTTCATTAAATTC TTCAAGTTC A AA CGT A'G CTCCTTTGAGA AGC'A nem3l6_ai2.seq 

ACTTATTATTATCTTTATC AA CT TTTG T A A ATTCA ATTTCACCT A ACTTC Majority 

' \ 76 , i0 , " • 7620 ■ 7630 ' 7640 >650 

t4?0 ACT TATf ATTATCT TTA X C A AC.'tT T TH TAA.AT. TCAATTTCA C C T AA C T t X> 2603 a!2 se^i 
-7467 ACT T A T TAT TATCTT TAT C A AC XT X X C t A A A X t C A A XT T:„C ; A C CTAACT TC ;,@^Seq. 

T X C X C C X XT TTAA.TCGXT A T X G X A G G A T ATT 6jx£ X C A C ATC A CG AATT TT Matoritv 



76 J 70 76 . 8Q *\ .7690- ;. 7700 

tSU tZZZ £'? Tt t X T A A TC G T -T AT T G T A G G A T A T T C T C T C A C A T C A C G A A T TTT 2603^12-seq 
7517 TTCTCGTTTTXAAXCGXXAXXCXAGGA X A TTCTCTCAC ATCACGA A T'T T T ne^lt.ai^ 

A G G G A T T G G A A A A T C T C T A A G T C T A T T A G G A T C C T C T G A T T T A G G A T T C A Majority 

' 7710 7720 7730 . 7740 ' . " 7750 

7570 A G G G A T T G € A A A A X C T C T A. A G T G T A T T A G G A T..C C X C X G A^T T A G GAT T C A 2603 ak^eq 
7567 AGGGA.TTGGAAAATCTCTAAGTGTATTAGGATCCTCTGATTTAGG ATTCA ne*316_aiTseq 

ATGTTGTTCTACCATTACTCTCATACAATT TC TTACTT-ATAAAACTCXCA, Majority 
' 7760 - 7770 7780 7790 , 7800 

7620 AXGTTGTTCTACCATTAGTGTCATAGAATTTGTTACTTATAAAACTGTCA 2603_*12.seq 
7617 ATGTTGTTCTACCATTACTGTCATAGAATTTGTTACTTATAAAACTGTCA nem316_.al2.seq 



FIGURE 20K 



^nm^t Report o< W 0 2006/078318 Bin mOTwfth Weighted residue >^httaWe; W PCT/US2005/027239 „ 

Inwsday. July 29. 2wt t>:^/ km « I 3 . 

TCU P g^P jySSCl g Eg SllElS C T T T T T GTCC TTCTCC TAAGTT MaioTTT^ ' 

7S . 10 7820 7830 7840 " 7850 

3^^ TTTC ^ CATCATATG ^ GAGTGTTAC ^ TTTT g B C CTTCTCCTA AB T T 2603 a!2 sea 
TCTAGTTTC AC ATC AT ATCTC AC TC TT A CTTTTT Gf C* *C C T A AGTT nSiS^il^eq 

CAAACCTCTAA CCTAC AGTTTATTTTTC AT GTATTCTAATTTAACCCCTT Majority 
78 . 60 7870 7880 7390 7900 

^Ty^TyFlTt S 1 ? 1 " t ° A ° T * T A T 1 tBB,G ^CTATTCTAa'tTTAA C C C CRT 2603 ai2.seq 
CAAACCTCTAACGTAGAGTTTATTTTTGATGTATTCTAATTTAACCC C T T ne»aTe^aI2^eq 

TAACTATTCCACC ATCATTA. TTAGGCCCACCAGTTC CA ATGCTATCTTTC Majority 
7910 7920 7930 7 940 7950 



7670 
7667 



7720 
7717 



7770 
7767 



7820 
7817 



7870 
7867 



-7920 
7917 



iM^^I TC ^ ACCATCATTATTAGGCeCACCACTTCCAA tBgDa T cBt T C 2603 aI2.seq 
T A A G T A T T C C A C C A T C A TT A T T A C G C C C A C C A C T T G C A A T G C T A T c" T T C nen^ai^eq 

ATTATACTTCCATCATTTCC .CTGTAAAGTA.TAAT. CACTTGGTTGTAATGT Majority 
^ 7960 7970 7980 7990 8000 

A T T IB A CTTCCATCATTTCCCTGTAAAGTATAATCACTTG cQ TGTAATGT 2603 ai2 sea 
ATTATACTTCCATCATTTCCCTGTAAAGTATA AT C A C T T G GT. T cQ A A T G T ne^T6L.a!2^eq 

TTCTCCGTTGCCAACCTCTA AATTGATTTTGTCACCCATAGGATCTTCTA Majority 
8020 8030 8040 ■ 8050 

II HS?? TT, " CC * * GC ? GTA A ATTC ATTTTBTC ACCCATAGG ATCTTCBIA 2603 ai2.seq 
TT.GTCCGTTGCCAAGCTGTAAATTGATTTTGT.CACCCATAGGATCTTC?A nemsTe.ailtseq 

TAGTTCC ATTAACA AT T..G AGT TTTCTTTTGTTAAAATCTTTT CA A A T T G T Majority 
f°60 6070 8080 / 8090 8T00 

TAGTTCC ATTAAC.A ATTGAGTTTTCTTTT. GTTA.AAAT cH TTTC A A AT TGT 2603 a!2 sea 
TAGTTCC A T T A A C A AT TGACTTTTCTTTTGTT AAA aQ CT'T TT C A AATTGT 

T G C T G A A T T T T A C A T A A A A T T T C A T T G T T A G A TGTATCG CCT GAAG TT AC Majority 
8110 8120 8130 - 8140 .8150 



7970 T G C T G A ATT T T A G A T A A A A T T T C A T T 6 T T A G A T G T A T C G G C T G A A G T T A C 2603 al2 sea 
7967 TGCTGAATTTTAGATAAAATTTCATTGTTAGATgIIaTCGgBtGA AGTTAC I^TSiSea 

TATCGGGGTGTAGTACTCA GGTTTGGAAG-AG A AT GACTTCATTAGTTCTG Majority 

««» 81170 818Q - 8190 8200 

DATllGGG G TGTAGTAC TCAGGTTTGG AAGAGA AQG AcBtC ATTAGTTCTG 2603 ai2 sea 
8017 TATCGGG G tH T aB TACTCAGGTT tQ-G AAGAGA AT G" A C T TC ATTAGTTCTG ne^T6^.ai2^eq 

TTAT TTCTCC ATC TG AAA GT TT A A A A6 GTTCC TCTTTC A ATTTTTG A A AA Majority 

82 t IQ . 8220 8230 8240 ^250 

8070 TEA T TT CTC C A T C TG.A A A GT T T A A A A G C.t T C C T C T T T C A A T T T TT G A A A A 2603 ai2 sen 
8067 TTATTTCBcCATCTCAAAGTTBAAAAGCTTCCTCTTTCAATTTTr^ 

GTACCA T CTT GA. TT TT.TC T TATtACTC CTC ATTATAA ACTTGTCTAAAAGC Majority 

8 2 6 ,° 8270 -8280 8290 - • . 8300 

?j?2 G I A A l C - T T G A T T T J T C f T At ACT C C T C A T T A T A A A CT-T GT C T A A A AB C 26p3^al2.seq 
;8U7 G T A C CAT C T T .G A T T TT T C T X A T'A-Qt C C T CAT "T A T A A A C f T G T C T A A A A™ C -^T^ll^eq 

• A GAT A T A T. C. T. A T A C C A * A A A T T A A AG A T G T € A T A A T t T T f CTC'TTT T A A A C Majority 

S3* 0 ... 8330 . •■ •• . 8340 8350 

■5* a ^ ^ I A I A j A rA CCAAAATT AA AQ A T GT C A T A AT TTTTCTGTT TT A A A "C 2603 ai2.seq ' 
8167 A G A T A T A T C T A T A C C A A A AT T A A A G A T G T C A T A A T T T TT C TCTTTTAAAC neasle^^eq 

TATTTAT ATA AA CTTTCCTTCGTGTTCC ATC T TC TT. TTACTGGTCCATTT Matoritv 

' . . " fjgg ^70 8380 8390." ^O* . 

^ 20 - TATTTATATAAAGTTTG CTT-C.G TGTTC CATGTTCTT tQa CTCGTCC AT T T 2603 aialseq 
«217 T AT TTAT AT A A A G TT T GGTT G.G KTT CC AT G T TCT|| TtA CT G G t C C A TTT ne^Te'ai^eq 

CGATAAA'TTGTAC.CTTTAGG. GTAATTAAGA TT TA A ATCT'A A AT A ATC A A C Maforltv 

. ; 84 /° • 8430 ^8440 8450 

8270 CCATAAATTGTAGCT.TTAGGGTAATTAAGATTTAAATCTAAATAATGAAG 2603 ai2.seq 
8267 CGATAAATTGTACCTTTAGGGTAATTAAGATTTAAATCTAAATAATGAAG nem3T6_al 2 !seq 



FIGURE 20L 



# \o\imn # 

A^Repo^^ W PCT/US2005/027239 

Thursday, July 29, 2QQ4 6;4J EM, ,. g .„ , ... ;|t , ^ „„,, Q ^ 

T T T T T>Tc''fU A8c-f gg! gWgg g A jjgrjjA gg T' j G fG TTTCATAACTATCTAAGGGAA Majori^ ^ 
8460 8470 8480 8490 8500 

TTTTTGTAAGTTTCCAGAGATTATCTGTGTTTGATAACTATCTAAGGGAA 2603_al2.seq 
TTTTTGTAAGTTTCCAGAGATTATCTGTGTTTGATAACTATCTAAGGGAA riem316_al2.seq 

ACAAAAAGTAACTCTCCCCATTTCCTTTTATATCCTCGGGCTTATCAGTA Majority 

85*0 8520 8 530 8540 8550 

ACAAAAAGT'AACTCTCCC CATTTCCTTTTATATCCTC GGGCTT.ATC AGTA 2603_al2.seq 
ACAAAAAGTAACTCTCCCGATTTCCTTTTATATCCTCGGGCTTATCAGTA nefa316_aI2.seq 

AGTACAAAATTA CTTTTATTTAGATATCCATTTTTTTTCATTTGTTCAAA Majority 
8560 8570 8580 8590 8600 

AGTAGAAAATTACTTTTATTTACATATCCATTTTTTTTCATTTGTTCAAA 2603_.ai 2 .seq 
AGTAGAAAATTACTTTTATTTAGATATCCATTTTTTTTCATTTGTTCAAA ne*i316_ai2.seq 

TTGGCT TTCATATGATGCACCCAGTTTAAAATTATTAATAC C A T A T G A T C Majority 
; ; 8610 8620 86 30 8640 8650 

TTGGCTTTCATATGATCCACCC ACT TT AAAATTATTA ATACC ATATCATC 2603_ai2.seq 
TTGGCTTTCATATG. ATG. C AC CC A|!t TTA AAATTATTA ATAGCATATC ATC ne-i3i6_ai2.seq 

TTCTTG CAACACCATCACTTATATCAACAATAATfTTTTCACTATTTCCA Majority 
8660 8670 8680 8690 . 8700 

T|lC THC G A AC AC CATC AC TT A||AT G A A C A A. T A A T T T T T T G A C T A T T T C G A 2603_ai2.seq 
TTGTTGGAACACCATCAGTTATATGAACAATAA.TTTTTTGACTATTTCGA ne«316_ai2,seq 

TTTACTTGACT C AA AATATC ATCTGCCTCC AT GAAGGC. TTTCATAC T A A A Majority 
' 8710 872Q 8730 . 8740 . 8750 

TT TACTTGACTCAAAATATCAT.CTCCCTCCATGAACCCTTTCATACTAAA 2603_ai2.seq 
THT ACTTCACTCAAA AT ATCATCTGCCTCCATGA AG G' C TTTCATAG-TAAA nem316_ai2.seq 

T G T T T C T C C T A C T T T A C T A A G A T A G T A C T C C T T T T G T T G. C T C T G C A C T T A Majority 
i - i i i . 11 1 ■ i ■ • 

8760 ; 8770 8780 . 8790 -8800 

8620 T G T TTCTC C T A C T T T A C T A A G A TAG T A.C T|J|C T T T T G TT GCTCTGG'AGTTA 2603_al2.seq 
8617 T G T T T CflC CTACTTTACTAA GAT AGTAC.TGCTTTTGTTGCTCTGGAG tQa nem316_al2.seq 

GTCCGT TTGTA G.TT G ATC CCCATTTAGCTTTAGGAGCTTCTGTCGGAATC Majority 

8810 8820 . 8830 8840 8850 

8070 BTC CHT TBJGT A CtflC ATCCCC AQ T TAGCTTTAGCAGCTTCTGTCCGAAT C 2603_ai2.seq 
8667 GTCCGTTTGTAGTTGATCCCCATTTAG C.tHt AGGAGCTTCTG tQg G A A T C xiem316_.ai2.seq 

CTTTTT ATAATCTCTTCAGCATTATTTGT.T A' A T T G TT TATCACTATAATT Majority 

8860 8870 8880 - . 8890 . * 8900 

CTTTTT ATA ATC T C TTC.AGC ATTATT.TGTTAATTGTTTATCACTATAATT 2603„al2.seq 
CTTTTTATAATCTCTTCAGCATTATTTGTTAATTGTTTATGACTATAA.TT nem316_ai2.seq 



8320 
8317 



8370 
8367 



8420 
8417 



8470 
8467 



8520 
8517 



8570 
8567 



8720 
8717 



CTCTCTCTCAATTGTGAACTTAGTTTGAAGGCCATAATATTTATCATCTT Majority 
i : ~ i : — : — i = r* J 

; 8910 8920 8930 8940- .8950 

8770 CTCTGTCTGA A'TTG T G A ACTTAG. TTTGAAG G.CC A.TA ATATTT ATCATC T T 2603jai2.seq 
8767 CTCTGTCT G A A-TTG-t G A A C T t A G T T TjG A A G..G CC A TA A T A T t.T A T C A T-G.T T ne&316_jai2.seq 

XTTTAAATC C TT TT ACG A C ATC T A C A C T CCTAC C Jl T C A A X A A T AT C TG A A Majority 
- - . 8960 % 8970 ' 8980 8990 9000 " 

C TT T A A A T C G T TT T A CG A C A T C TA C A G T CC.TACC A T.C A. A A AATATC T G A A "2603_^al2-.steq 
CTTTAAATCCTTTTACGACATCTACACTCCTACCATCAAAAAT A.T CTGAA nem316_.ai2.seq 

C€,ATA CCTA ACTA ATCC AACCCTATTATC ACTCTTTGCTCCTAA A A T A-.T C Majority 

• 9010 ' 9020 , 9030 . 9040- . . . 9050 

C C ATAGGTAACT AATGCAAC C C TATTATCACTGJTTGCTCCTAAAATATC 2603^.ai2.seq 
CCAT.AGGTA ACTA AT'GCAAC C.CTATTATC ACTGTTTGCTCCTAAAATATC nera3l6_aI2;seq 

TTTTAC TGCGGTCCCAAGAGCTTCGGCAGCTT. TCTT'GGCTTTATTATCCC Majority 
9060 9070 9080 9090 9100 

TTTTACTGCGGTCCCAAGAGCTTCGGCAGCTTTCTTG.GCTTTATTATGCC 2603_ai2.seq 
TTTT.ACTGCGGTCCCAAGACCTTCGGCAGCTTTCTTGGCTTTATTATGC.C nem316_al2.seq 



8820 
8817 



8870 
8867 



8920 
8917 
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T T T G A»A" A'-A T»T 



TGAGTTAGAATTATCGAGTACC Majority 
' 9 \ l ° 9130 9140 9150 

TTTr i i * 1 J- TGGGC CATCGTTATTC ATTGAGTTAGAATT. ATCGAGTACG 2603 aI2 seq 
8967 TTTGAAAATTTGGGCCATCGTTATTCATTGAGTTAGAATTATCGAGTACG ne*a76_aiH 

AAGACAACATCTAACGGCTTTTGTTTG 



9020 



TCCACTGCTTTTACTATGGTTTT Majority 
91 , 60 9170 9180 9190 9200 

AAGACAACATCTAACGGCTT.TTCTTTGTCCACTGGTTTTACTATGGTTTT 2603 al2 
AAGACAACATCTAACGGCTTTTGTTTGTCCACTGGTTTTACTATGGTTTT n^Telail^eq 

TCCACTGACAGTTAACTCAATTTTATATT 



TATTATG ACCT AA ATCACCT A Majority 
92 . 10 9220 9230 9240 9250 



22? ?CCAC?r ArJ "I^V^I^ AATTTTATATT ^ A TTATGAGCWA ATCACCT A 26^2^ 
9067 TCCACTGACAGTTAACTCAATTTTATATTTATTATGAGCTAAATCACCTA ne^IG^ailTs 



CTTCTGAAATACGTTTAGATAATG 



TTCCCTCTCGAATTTCTCTTATATGC Majority 



9120 



gggg 9270 9280 9290 9300 

SJ??JStwItSSli:.J5*I*f TCTTCC . C TCtCCAATTTCTCTTATATCC 2603_ai2^ 



9117 CTTGTGAAATACGTTTACATAATGTTCCCT.CT.GCAATTTCTGTTiuKcSSueq 

TCACCTTCACTTGAATATGG G TTAACTGCTTTTGCCTCTGACTTTCCATT Majority 
9320 9330 9340 9350 

9170 TCACCTTCACTTCAATATGGGTTAACTGCTTTTGCCTCTGACTTTCCATT 2603 al2 sea 
9167 TCACCITCACTTCiATATG.GGTTAACTGCTTTTGCCTCTGACTncCAU neSl^a^ 

TGGAACTGA A C-CTTTAACATGCTCA A GTT T A T AACATTGCTTTCTATCTT Majority . 

9 ' . 9370 .9380 9390 9400 



9220 T G G. A A C T G A A C C T TTAAC ATGCTCAAGT T T A T A A G ' 

ATAAGATTCCTTTGTATCTT ne«316_aI2.seq 



9217 TGGAACTGAACCTTTAACATGCTCAAGT TT \ 



ATTCCTTTGTATCTT 2603_jai2. 



seq. 



C A T A A A T T C C T G T.G GGGGGA TACTGCTTATCTAGTTCTT CGTGATTTTGT Majority 
94 , 10 . 9420 9430 944 0 9450 . 

??70 C AT A A ATTCCTGTGGGPG.GATACTGCTTATCTA'GTTCTTcBtGATTTTGT 2603 ai2 sea 
9267 CATAAATTCCTGTGGGGGGATACTGC.TTATCTAGTTCTTCGTGATTT^ 

CCAATTGTCGAATTTTTATCUCAC T ATTTTGTATCGTAGTTTTTCCATT Majority 
947Q 9480 9490 9500 : 

9320 C C A..A TTGTCCAATTTTTATCACCACTATTTTGTATCGTACTTTTTCCATT 2603 ai2 sea 
.9317 C C AA T-T G T G-G A ATTT'TTAT C.A CCACTATTTTGTATCGTAGTTTTT C'C ATT neSle^eq 

ACTCTCAACCTTAACTTGC CAAGTCTGGTTAGTCTTTTTATAACCTTCCG Majority 
95 . 20 9S30 9540 9550 

22 A r^Ho AA ^ CTTAACTTG 00 AACTCT GGTTAGTCTTTTTATAACCTTCGG 2603 ai2.seq 
9367 ACTCTCAACCTTAACTTGCCAAGTCTGGTTAGTCTTTTTATAACCTTCGG ne^Ie^i^eq 

GCGCTGTTTCTTCTGATAAAGT ATAATCTCCACGTATGAGATTATCAAAA Majority 
95 , 6Q .. 9570 9580 >■ . 9590 9600 

'fZrZlZZZli It** C A t A A A « T A T A A T C T C C A G G T A.T G AG AT TATCAAaI 2603^ai2,sea. 
,9417 G C G € T G T T T C T T C T G A.T A A- A*,G TATAATCT C CyA GG T A T G A G A T t A TC A A A A ^l«WSeq 

, C T A G C T T C A; C C TGT TAG C T C A CCA.G.TTA C-t T TT T CTAT T'T'T A .C T1TG T G.G ' Majority 

9g?Q . .3030 , 964 0. . ~ 9650 

M70 GTAG C T TC A C C T G T-T A GCTCAGC AGT.TACTT TTT C T A T T T T A C T T T C.T C G 2603 ai2 seq 
9467 GTAGCTTC A C C T G T.T A G C T C AGC A. G T T A C T T T T T C T A T T T T A C T T T C T G G ne^T^ail^eq 

ATGAGCAGTA G T TT T T A A A A C!A A A GCT AG CTTTTC A A AG T G C I TTCTTC T Majority 

«670 • 9680 9690 " 9700 



A I ? A « J * * I A * J £ J J J A A A A C A A A G G T A GCTTTTG A- A A G TGGTT TG T T C T 2603_aii:seq 
9517 A T G A C C A G T A G T TT T T A A A A C A A A G G T A G C T TTTGA A AG T GG TTTGTTCT nem316_ai2Tseq 

. GCTCATCTGTCTTTTTAAC A ACTAACTTTCCTTTAGCACCATTTTCCGGT Matoritv 
97 f IQ 9720 9730 9740 9750 

9570 YTzTT^JTJ^ 

GGTCATCTGTCTTTTTAACAACTAACTTTCCTTTAGCACCATTTTCCGGT ne^Ie.a i2^eq 
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" a c c c IpM BEB jpSI g SE E aTFF c 5 j ; f f T c c c a c a a A irr ^^— 

!™ 9770 9780 9790 9^00 

A CCCT ACTTTCCCCTAAAACATTGGTATTAAGCCCTAT T T C C C A r & a apI o^o ,o 
ACCGTACTTTCCCCTAAAACATTCCTATTAAGCGGTATTTGCGACAAACA neffl3T6^.a!2?seq 
AAAAAGACTTAACGTCAA T A TTTTAGAAAA T T TTTGOTATTTTCTCATTT HajorUy 

a8 . 10 f820 9830 9840 g^ 0 . 

AAAAAGACTTAACGTCAATATTTTAGAAAATTTTTGCTA tTTTCTrATTT 7r m „ 
AAAAAGACTTAACGTCAATATTTTACAAAATTTTTGGTATTTfTCTCATTT neffl3l6.1al2lseq 

TACAACTCCTATTGTCCCCAAATGTCGTTTCTA A ATCTAACATCACATAr Majority 
98 7° 9880 9890 9^00 

AGAATATCCTAGA ATATAC AAACTATCACTTATTATCATATCAATAATTT ICaJority 
9910 9920 9930 9940 . ' 9^0 



9620 
9617 



'9670 
9667 



. I « ■ j 

9770 AGAATATC CT AGAATAT AC A A A CT ATC ACTTATT ATC ATA tC A AT A A ttt *«v* 10 
9767 ACAATATCCTACAATATACAAACTATCACTTAT?"^ 

CTTATTAT A AGCTATGGA ATTTT A ATCTTT T T'TCCC AATTTT.TGAAT G A T Majority 
" • "JO 9980 9990 toOOO 

9817 CTTATTATAAGGTATGGAATTTTAaBgTTTTTTCCCAATTTTTGAATGAT ne«3T6_aiCeq 
TTTTCTTTTTATTTG ATAATCT TATTTTTTATTATCTTAGAAATATTTr; u»^, 
. 10010 l ^20 10030 . . . 10040 1OO50 

9870 TTTTCTTTTTATTTGATAATCTTATTTTTT ATTATCTTAGAAATA TT x r 1 ' reni ,o 
9867 TTTTCTTTTTATTTGATAATCTTATTTTTTa"^ 

ATTAGCTT.AAGTAGTTGATTTTTCTT T T TT T ATGTTTTAAAATATTGCTT Majority 

• M 10060 f 10070 ' 10080 10090 , 1O1O0 

9920 A Tffl AGCTT AAGTAGTTCATTTTTCTTTTTTT A : T GTTTTAAAA TAT T C C T T i?m<* *<> 

a ttacctta> ^^ 

AAAAATAATCTTTGAGAGAGAGTTTACTG AATTGATTGAAAATT.ATTTAG Majority 

10110 ""20 ' ioiao , io^o 10 IS0 

" 70 "AAAAATAATGTTTG'AG AG aJ B I G TTTACTCAATTGATTGA AAATTATTTAr ?fi n. .0 
9967 AAAAATAATGTTTCAGAGACAGTTTACTGAAT?GA??CAAA^ 

AAAAAGACATCCTTAATCAAATAAAACTT C TAAC, TTTATGC.TA T G ATTAC Majority 
. 10160 10 f70 10180 1Q190 1O200 

J°5? 8 AAAAAGACATCCTTAATCAAATAAAACtTCTAACTTTA TGCTATGATTAC 26f» ^9 ^ 
10017 AAA A A G A C ATCCTTAATCAAATAAAACTTC T A ACTTTATGCTATGAT T" A C SSiSe, 

TACCCTTCCATTACTCTAGACAAATCAT Gl C A T CAACTKG TT TATCTGA Majority 
10210 . 10220 . 10230 10240 ^ols 0 

?°?68 T A C C C T T C C A TT A C T C T A G A C A A A T C A T G T'C A T C A A C t T G G T T T AT C T C A ?<™ ^ ~~ 
1O067 T A CCCTt C C ATT A Ct.CT A G AC A A AT C A T G T-C A T £ A A £ TT GCT.f ?}?££S ii28ffi2* 

- ACYA C T T AT- T. A PC A A AT A Tj r j T C-A T G A TTTAACAACTT T'A TUAiCA T C Majority 

• 10260 . 10270 1O260 ' : . 10290 10300 

10118 A G T A.C.TT A T T A G G A A A T A t T G T C ATGATTTA AC A A CT-TT AT T T A "A- PA r T r 9 A» * 
I0H7 ACTAfCTTA'TTACCAAATATTGTCX TC A TTT A A C A A C T T T A T T T A AC A G T .C SSiSeq 



A AC TCTCTCTGAAT 



A T C G A A A A G A G T A C A A T A G T C T A C C A A T C..T A A T G G T Majority 



. . • 10310 10320 103M 10340 I0 3S0 

10168. A A C T C T C J C T G A A T A T C G A A A A G A G T A C A A T A G T C T A C C A A T C T A A T C r T fRtn =i» ™ 
10,67 AACTCTCTC. TCAATATCGAAAAGAGTA.CAATAGTCTACCAATCTAATGdT' neS76^.a!2?seq 

GTAACTAGAGAACAAGCTTTCAAATATAT TTATCATnAATCACACGTTTT' Malo^l^ . 

10 ? 60 10370 , 10380 10390 Io400 

10217 GTAACTAGAGAACAAGCTTTCAAATATATTTATCATCAATCACACGTTTT nS'ajHeq 



FIGURE 20O 



i metl^^vith Weighted residue weioht table 



,iSSS!5SS*lWO 2006/078318 ~ m me3 ^ th weighted Citable. . W PCT/US2005/027239 , 7 

a c a a c^jji y A" 4j a isjiS G/ ypq WMm ^t F cagccagg ttacctttaa : 

a0 ? 1Q 1Q42Q 10430 10440 1045O 

10267 ACAACTTTTAAAATTTTTCATCACGAATCATTCAGGCAGGTTACCTTTAA nem376_al2^eq 

CTTACTTTAGT GAAAA ATTTGGACTATCTTCTGCA ACTGCTTATCC C A T A Majority 
10460 10470 104KO inxnr. 



10318 CTTACTTTAGT C A A A A'ATTTG GACT ATCTTGT CC A A C TGCTT ATCCC AT A 2603 <=~, 
103,7 CTTACTTTAGTGAAAAATTTGCACTATCTTGTCCAACTGCTTATCGCATA ^iSe, 

CGAAAACATATTAGTCCGTTACTAGAA AAACTTCGATTTCAGATTTTCAA Majority 

„ 1Q ? 10 l°f 20 10530 10540 10550 

10368 CGAA AACATATTACTC CQ T T A-CTAC A A A A A CT TG G AT t T C AG AT tIH T C A A 2fi03 <^ 
10367 CGAAAACATA.TTAGTCCGTTACTAGAAAAACTTGGAT.T tQ AG AT ^ T T C A A ^T^iSeo 

' AAATAC TATTACCGGTGAC GAGTATCGAATTCGCTATTTAATCCCATTTT Majority 
' 10560 1( >?70 10580 i0590 *' 106OO 

10418 AAA TACTATTACCGGTGACGAGTATCGAATTCGCTATTTAATCGCATTTT 2603 a l2 ^ 
10417 AAATACTATTACCGGTGACGAGTATCGAATTCGCTATTTAATCGC. ^BSD T T SlalHeo 

TAAATGCTCGAT T T GCTATACAAGTTTAT C CCTTGTCTAACAfGGATAAA Majority 



10650 



10468 _T A A A T G C T C^ A T T T G G T A T A G A A G T T T A T C C cffT GTCTAAGATGGAT f A 2603 - 
10467 T A AATGCTCGATTTGGTATAGA'AGTTT AT C C CT tQ TCT A A G'ATG G ATA A A 

. TTGCTTAT C A A ACCATTGTTATTA GAATACTCAACTACTTTTAC T G C T T C Majority 

, 10660 10670 10680 1O690 " 10700 . 

J^}5 "5£I» A ™^ 2603_ai2.seq 
10517 TTGCTTAT C AAA C GAT T G TTATTAGA.ATACTCAACTACTTTTACTGCTTC nea316_ai2^eq 

T C A T T A C T T C C C A A A T A C A T T T A T T TT CTTTGAT AC ATTCTTGTCT-CT AT Majority 
. I0 ^0 10720 10730 1O740 _ 10750 



10568 T C A T T A CT TCCCAAATAC A.TTTATTTTCTTTG ATACATTGTTGTCTT T A T oeun al % c<MI 
10567 TCATTACTTCCCAAATACATT.TATTTTCT XT G AT AC A.T ^0 T TG TCTCTAT SSSSSe, 

CAT. CGA AACGTATT AATTATAATGTAGTTGTCCCTTACTCATCCCTTTTC Majority 

^J 60 10770 10780 IO790 IV&QQ 

10618 CATGGAAACG TA TTAATTATA AtGT AGTTGTCCCTTAC TC ATCCCTTTTC 2603 ai2 sea 
10617 CATGGAAACGT'ATTAATTATAATGTAGTTGTC C'C T T A C T C AQ CCCT.TTTC SSeq 

ATTGAAC TACAAAATATATTTATCT'ATGATACATTACAATATTGTGTCAA Majority 
10810 1Q82 0 10830 1Q840 " 10850 

10668 AHTGAACTACAAAATATATTTATCTATGATACATTACAATATTGTGTCAA 2603 ai2 sen 
10667 A TTGAACTACAAAATATATTTATCTATCATACATTACAATA^T^TGT^AA ne^T6^ai : 2^eq 

AAATGTTATTATAGATT CCT TTAAAATTA.ATTTAAAAAAAGACCATATAG Majority 
• 10860 • • 10870, 10880 10890 10900 

J 0 ™ AA aJ^tJ A V^ ATAG ^ TTC C ^ T T A . a A ATT'AATTTA A A AA A A.-G ACGAf ATAG 2603_ai2.seq 
:10717 A A AT GT.TATTATAGATTCCTTTAA A;A * T A.AT T TJV A A A AAA G.A C G AT A TA G ■ henj316_ai2^eq 

A C T A T AT TT T T C T T.CC TT A CC T T A C tTCCU T A A 6 TCTtUTCT A A TGC A Majority 

. •_. . " jfflgj , . ;. ^°?20 ./ 1093t> .'" 10940 . . 10950 



10767 ACTATATTTTTCT T'G CTT ACCTT A.C-T TCCC ATAACTC TT TTT C.T A AT CC A. ne^T6^i2^^ 



I A ? I ^ ? 5 ? A I A A . C T C TT T T T C T A.A, TCCA 2603_ai2. 

2:seq 

A A T T G C A C T G A C A A G C C T A T C G A T A A T G.T A A T A G C T A T T T T C G A A A A T T A Majority 

. ^ 10960 10970 . 10980 10 990 1.1000 

Inlt AA II^ ACTCAG A A G CGTATCGATAATGTAATAGCTA T T T T C G A A A ATT A 2603 al2^eq 
10817 A A T T G G A C.T GAGAAGCGTATCGATAATGTAATAGCTA T-.T TTCGA AA'ATTA iH^Te^alltL, - 

TCCCAAATTCCAAAAA TTATTACAGCCACTCAAAGATGCTCTTCCCTTAT Majority 

11010 *1?20 U030 11040 U050 

J0868 TCCCAAAT.TCCAAAAATTATTACAGCCACTCAAAGATGCTCTTCCCTTAT 2603 aI2 sea 
10867 T C C C A A A T T C C A A A A A T T A T T A C A G C C A C T C A A A G A T G CBC T T C C C T T A T nea3l6_ai2.seq 



FIGURE 20P 



Thu^j^j 1 ^S.°WO 2006/078318^'^^ vl ^ Wei ^' ed '^^'«^9M«aMe. ' ^PCT7US2005/027239 )8 

: ^ ^ U080 ^ 

* C * T TTATTTTACGATT A .AATCAATT*ATCCCTE« A ACAATTT«ATTTcr Hajor.tv 

— "M° U13 0 ,,;,„ ^r so 

TTCATCCAACTATCATAGACATCATAA A T T A A C T-« « T A T T C.T C C G A r. r r. A Kajorlty 
- i!if2 "? TO ■ «»80 . „,ao ^ 

TAATTACAAATTCCTTA A G T C A A A T T G C A C A A T A C A C C T T T A A r. r. A An A a ^ wl(y 

CATTTTCTTC-TCCTTTGTGr. TC ATC TA GA AA-CAATTATrAAAAATCATAT HaJorlty 
.- : "260 „ ?7 Q 11280 U290 ~^00 

TCCTCCGATACAGATAGCCGTACTAACTACAGACTTTA T TAATAACnA A-A u..^. 
' "? 10 11320 U330 „340 TZsO 

TTTTAACACAATGTTTATTACACACGT.TTTCTTnT AAACAr.ATTn A TjTC Majority 
. "360 U37Q H380 U390 ~ ^ Q 

CACCCTT'ACTATTTATTA. ACTCA-TGATC.T T-T CC A* A T A TTACTAA TCtt a- a ^ ortty 
- . "? 10 H420 11430 „440 ~ T?450 

' TCCACATATTATTATTACCAATCCAA AGCTTTCTTC^ T TTATCAA AC AT ? u^-,^. 
- 11470 U4 ao „490 T^ToO 

41317 T C € A G A^? A T^AITATIACC.A AhS^^ A A ICCTIIC tS^C CT H A TQ A A A C A T i n£^l€Lal2?seq 

AaATTTCTTCAGAGAGTTTAATTACATATATTGATTTG A TTCATArTTCA UaJorlty 
. 11520 ■ US30 11S40 ~nTs6 

GACCAfi'ATCA'ATOA AATCCA AG A -A AT t A T T T r. A t C A A T A C A- "fi-C A A-C A 'A A 'A ■ HaJprHy • 
. . ' : _ . . "560, 11STO ; ., 11580 , . . u 59 Q ■ . , ttkna- ■ 

s ; e s ;^ ^ s i ; ^ ; ? ? I S£ ; i i ? ? ; ? ? ? g ; - ; - 1 « j^? ; - i - i ^sii, .-. ... 

ATATTCTAAACTTTTTG CAAAA ACT-AATGAAATAAr TACTCCTAGCTrcT u~ f „,. r 
■ •■ - • _„ -•• : "«» 11630 U64d ~~u£ 0 

A T A A- C-T CTTAAAA ATTAACATTA AAAAGCTAGA GCATTCTGTAATGCTCT 'Majority 
"?60 U670 . . „ 680 . . . 1I69Q 



L ----- ■ ■ • IIDMU II700 
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T^S^! ?^^cWO 2006/0783 1 8 * Weighted residue weight tabie, ' W PCT/US2005/02723? 3 

A G C TXp^g *j| A A T C r ,A c T T T T C A A C T ? ^.^^ " 

• _^20 11730 U740 U7S0 

USCT AGCT?TTT!!T^ T ^ ATTTTTTKAATAOATAATCCAA " TTT CAACTi 2603 a!2 sea 
11567 AGCTTTTTAATGTTAATTTTTTTCAATAATATAATCCAACTTTTCAACTC ne^TeJ i ^seq 

TTTTTTC CC ATGTG AAATCTTCTTTAATT C TTTT.ACCAATATT.C TGTTGT Majority 

J " H780 ~ 1 1790 H800 



I16I7 TTTTTTCCCATGTGAAATGTTC 



11617 TTTTTTCCCATGTGAAATGTTCTTTAATTCTTTTAGCAATATTCTGTTGT ne^la^ 



II3 A *II5IIII A ?? AATATTCTGTtGT 2603_al2.: 

S.seq 

ACTTTCTCTCTTAATGCCTT.ATC T TTTACTAATAAATCAA.GAGATTCATr. Majority 
J "? 10 "f 20 U830 lra 40 U8S0 



CAGTCACTGAGTA TTTTCT TCCATGATGATTCCTAACTCAGGCCTATCAA Majority 

11860 n ? 70 H38Q H890 U900 

11747 GAG TCACTCAGTATTTTCTTCCATGATCATTCCTAACT CAGGGCTATCAA 2fim ™ 
11717 CAGTGACTGACTATTTTCTTCCATCATGA^ 

TAACT-TCAACTG TTCCAC CGCGA T C TCTTCC AAT A A TA GCACTTGAAACT Majority 
• ll f_ 10 . 11920 11930 11940 ~" H950 - 



i\l^ltArllry.flnll^^^ C ^ GCGkTCTG TTGC AATAATAGCACTTGA'AAGT 2603_ai2.seq_ 
11767 TA ACTTCA ACTGTTCC ACCGCGATC.TCTTCCAATAATAGC ACTTC A AAGT ne^^a^eq 

AGACCAGCTTCTAAAATAGAGGTTGG T AATCCCTCTGGA TAG A TT G A A C G Majority 

. LL2f°. 119 ™ U?80 U 990 12000 

11817 A A-CCAGC TTCTA AAATAGAGCTTGGTAATCCCTCT CCATAC ATTC AAT C ?fifv* ^ 
.n817,A C ACC.AG,TTCTAAAATAGAGGTTCGT A ATCCCTCTGGA^cl 

GTAAACAAAGATATCAGTCTG.TGCCAT T AAAGACATAGTCTGTTCAAAGT Maiorirv 
' A2 ? 1Q ^020 . ' 12030 • 12040 12050 

11S6T r T A iAr' AA 'i^AT- A J^ A ^ TC ^ T A A A G A CATAGTCTCtTCAAAGT 2603 ai2.seq 

.11867 CTAAACAAAGATATCAGTCTGTGCGATTA A AG A CATAGTCTCT T C A AAGT ne^TeJiHeq 

TTAATTTCCCCAAAAACTTAA T- C T G T T T G.G.A CTGATATTT.CTCTTTCAA A Majority 
_ 12060 . 12 ?7° 12080 ' 12090 ^ 0 

11917 T.TAATTTGCCCAAAAAGTTAATCTGTTTGGACTGATATTTCTCTTTCAAA nein316_ai2^eq 

TGTQCTAATTC AGGTCCG TCTCCTGCAATCTGTAAATAAACATTTTCAGA Majority 
12110 l 2 ? 2 ** 12130 12140 12150 

U967 TITII^^ 

U967 TGTGCTAATTCAGGTCCGTC T C CTGCAATCTGTAAA T.A A A C A T T*T T C A G A ne«3i6_ai2.seq 
. G ^ ACTGTG A.C AT'CG A AAAT GCT T. C T A A G A G G A.ATTC AATG.C C T T T, T T C .T T Majority 

. 12160 , ., 12 P° . 12180 12190 .•■ 12200 

?5?* 7 G T A C T;G T G.A CAT CGAAAATGC TT C T A A GAG C A A T T C A A T G C C T T T T T C T T 2603 a!2 seo 
12017 G T A CT W.ACATC G A A A AT G;c T T- C T A AG A G C A A T T C . A A T G C C TT'f * TCtT SSiOeq 

• TAATA A-T'T "C T A.C C A G C A T A AG T G A t G A A A A TA T C A T.C A G C A G A t TT T'TC A Majority 
- • • 12210 " , 12220 . . 12230 , ; . .12240 ~ \- 12250 

: g°£ TAATA AT TG.t A C C AG CAT A- A G T G A T G A A AA T A T C ATC.ACC A*G A T T T T* T C A 2603 ai2 sea * 
12067 TAATAATTCTA.CC A*G CAT AAGTGATG A A A AT A t CA T C A G C A 6 AT T T T T C A n^^Te^al^Seq 

A G G T A-A GCCGTACC AG C AAA AT C A G A G C C TAG ACTTTCAGATAC CG A ATT Majority 

* 12260 , 12280 1229Q . - 12300 

. ? 2117 A G G T A A:GC C G T A C C AG C A A A A T C A G A G C C TAG ACTT.TG AGATACC g A A T t 2603 ai2 sea 
12117 AG G T A AG C CG TA CCA 'G C A A A A T C A G A CC C TAG ACTTT C A GA T A.C C G A A T T 

, ATAAATAACT CCTTTA GCTTCfATATTAAAATGTTTTAACCATTCAACCC Maloritv 
* : 12 ? X0 *2320 12330 12340 . n>350 

2S A? AAA ? AA ^i^I TTAGCT ^ CTATATTA 2603_al2.seq 
12167 ATAAATAACTCCTTTAGCTTCTATATTAAAATGTTTTAACCATTCAACGC nem316_ai2^eq 



FIGURE 20R 



m / 07/^7 ^ 

^^^1wO2006/078318 fanKiim ^ w ^^ res ^ wei a wtab ' e - ^PCT7US2005/027239o 

XL£3L ffi fej W' 1 1 Miff ft g ^ ACCATAATCCTTAiVCACGCCCT , UaJorUy 

12 ? 60 12370 »2380 12390 1240n 



T £ cJc^GL^^%% G J^ T MM^^ GGACGATUTGCTiAACACGCGci 2603_ai2.seo 
12217 TTCTCTTGGATACCGCATAAAAATCTGGACGATAATGCTTAACACGCGCT ^16^12^ 

CTGACAAGATGTTCAYAGATAGCTCCAAA G AAATCTAAAAAACCATTATT Majority 
, 12 ? 10 12 ?20 - 12430 1244Q ^50 

rTrAr^^r^T^TTT-o^ T ^ G ^ TACCTCCAAAGAAATCTAAAAAACCATTATT 2603 al2. 5 eq 
12267 GTGAGAACATCTTCATAGATAGCTCCAAAGAAATCTAAAAAACCATTATT nemsle.ai^eq 

GACAGAAAAATGACTTCACCCATG-CTCTA A AACAATACTAGGTAAATGGT Majority 
' ^f 60 ^2470 12480 l2 49Q X2500 

\tl\l ?!J A J AAAAA IJ AG r €ACCCATGGTCt ^ 2603 al2.sea 

12317 GACAGA-AAAATGACTTGACCCATGGTCTAAAACAATAGTAGGTAAATGGT nemsle^l^eq 

C A T T CTTTGCA AAAGAT ACCCCTTCTAGCCTTGTTAACTCAAAACCTGTA Majority 
12510 12520 12530 12540 - 



. __i ~r~ »~ w 12550 

12367 GAt?c???cc£M ^ A ^^^n CTACCGTTGTTAACTGAAAACCTGTi 2603^12;seq 
12367 GATTCTTTGCAAAACATAGCCCTTCTAGCGTTGTTAACTGAAAACGTGTA ne^.l6_al27seq 

TTACAAATCA GAAAA TCAATATTTTCATCTGAAACATATTTCATCAGCGT Majority 
I 12560 12570 12580 12590 12600 

ggj ITy^TTATlT^ 2603^i2.seq 
12417 TTACAAATCACAAAATCAATATTTTCATCTGAAACATATTTCATCAGCGT ne«316^al2^eq 

G T TC T ATTCTC CAT T-TT T GTTAATAATAGGATAGCGCTGCTTGACAATAT Majority 
12610 *2620 12G30 1264Q ^0 

P 467 CTTGTATTCTCGATTTTTGTTAATAATAGGATAGCGCTGCTTGACAATAT 2603 al2 sea 
12467 CTTCTATTCTCGATTTTrGTTAATAATACGATAGCCCTGtTTGUAATAT ' ISlS^eq 

TTTTGC-TCGGTAA ACGCTAA.A T T TTTCTACCCTTC TCT T.C ATCTATAATC Majority 
12660 . 1267 ° 1268Q 12690- 12700 

12517 T T T T G G T -C GGTAAACGGTAAATTTTTCTACCCTTGTCTTC A T.C T A T A A T C ne«316_al2.seq 
CCTAAATCAC CATC ATTAGTTGTT A C A A. T A A C A A C A C C CT A GCC'AC GCTT Malorltv 

12710 f?.7?/) VrZ^T' ' rm_ 



12 710 12720 12730 12740 



12750 



^I^ AA I^^^ TGATTAG ^ TGTTACAAT ^ ACAACACGbTAC CGACCCTT 2603 ai2.seq 
12567 GGTAAATCACCATGATTAGTTGTTACAATAACAACACGGTAGCCACGCTT ne«3l6_al?seq 

A A C C A A A T C T C CT G T C A T T T T A T C T G T A T A A C G T T C A A T A C C T C C G A C G A Majority 

12760 . . 12 p0 1278Q 12790. ~~ 12800 

12617 AA CCAAATCTGCTGTCATTTTATCTGTATAACGTTCAATACCTCCGArri 5>firw 

12617 AACCAAATCTGCTG'TCAT-TTTATCTGTATAACGT^C^A^AC^T^C^aVgg^ ne^i6^a'i2?seq 

AGGGTAG AT A- A T A TCCTGAG AA-AACACCAAC.TGTT.TTTACCTTATTTTCC Majority 

12810 . ■ 12 82Q. : . 12830 ,12840 ' - 12850 . 

12667 AGGGTAG ATAATATCCTGAGAAAACAGCAACTCTT T T T AC C TTATTTTCC 2603 al2 sea 
12667 AG GGTA OA T A- A TAT C C T G AG A A A A C A G C A ACT G TT TT t A.C CT T A f f T X C C-SlaiO^, 

ATA T T TAT C C A C T T T C A T C A A T A A G C C A T - C T T T T A A G C C ^ .T .T A ~ A T C A T A- G Malorltv. 

12860 , ^ 1287^ . 12880 ... . 12890 \ ' " 12900 ' 

12717 A T A T TT A T C C A C T T X C AT C A AT A A G C C A T C T T T T A A G C q:tT T A A T C AT AG 2603 ai2 sVri 
.12717 AT A T T T A T C- C A C T T T C A T C- A* ATA A G C C A T C-T T T T A Afi-C T T. A A T C A T A G TO^16^12^eq 

CAACTATT TTTTtGCTC TTT T GC T C TTCTCCTACCAACACTCGAAC.AA AT Majority 

. 12910 * 2 ?20 12930 12 940 12950 

12767 c A A C T A T T.T-T.TJ TCGT.CTTTT G C-T C T TC TiGCTACCAA'C ACTC GAACAAAt" 2603 ai2 sea 
.12767 CAACT ATTITTT.T'CCT'CTTTTCCT C.TTC T G CTACCAACA C.T C GAACAAAT' hfe^lQja^^Seq 

TCATTT ^GCATAA AT A CTAAATATTTGTATCGCTTCTTCTTACC AT ATTT Majority 
. 12960 12970 12980-' . 12990 . . ^3000 

i CA in^ GCATAAATACTAAATATTTGTA ^ CCCTT CTTCTTACCATATTT 2603^i2.fceq 
12817 TCATTTCGCATAAATACTAAATATTTGTATCCCTTCTTCTTACCATATTT nS.ai^eq 
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" " ™ 10 13020 13O30 13040 13050 

12867 TTTTATAATATAGATCCCATTCCGTATCATCTAATATTTTCGAAATGGTG nemSlG^alCeq 

AATCATTCAATACATGAAA AACATCCCCAAATTTTTTAACTCGTUAGAG Major 1 ty 
i 3 ? 60 13070 13080 13090 13100 

12917 AATGATTCAATACATGAAAAACATGGCCAAATTTTTTAACTCGTGAAGAG 2603 a!2 seq 
12917 AATGATTCAATACATGAAAAACATGGCCAAATTTTTTAACTCGTGAAGAG ne^76_a 12 1seq 

TGTCCAATTTCGTGTAACAG ACCAATAAAATTAACCTGATAAGTCTTATA Majority 
13 1 10 13120 13130 13140 isTsO 

]fIL TGTCCAATTTCGTGTAACAGACCAATAAAATTAACCTGATAACTCTTAT A 2603 ai2.seq 
12967 TGTCC AATTTCGTGTAAGAGACC AATAAAATTAACCTGATAAGTCTTATA neo3t6_al2~seq 

TCCCATCTCTGACACACCA TAATTCAT T-T CAGAGTC. AACAAAATCAATAA Majority 
m 13160 13f70 13180 13190 13200 



^nil I^ GA I? TCT GACAGACGATAATTCATTTCAGAGTCAACAAAATCAATAA 2603__ai2.seq 
13017 TCCCATCTCTGACAGACGATAATTCATTTCAGAGTCAACAAAATCAATAA ne^lG.aiCeq 

ACATCTCTTCTCCAAAGC CACATGTTTCTTCGAAAACGCTCGTTTTCATT Maforitv 

^_ 13210 1322Q 13230 13240 [3I5O 

13067 A. C. A T-'C TCTTCTGCAAAGCCAGATCTTTCTT'CGAAAACGCTCGTTTTCATT 2603 ai2 seq 
13067 ACATCTCTTCTGCAAAGCCAGATCTTTCTTCGAAAACGCTCGTTTTCATT ne^Te^aiiTseq 

AAACC AGC€ C A A CT A AT AC A CTC TT CA ATT TC TTTA TA GTC A A A TTCTT G Majority 
13260 13270- 13280 13290 13300 



13117 AAAGCACCCGAAGTAATAC 



; ;v:: v ------ actcttcaatttcttta'tagtcaaattcttg 2603 ai2.seq 

13U7 A A A G C AG CCGAAGTAATACACTC-TTCAATTTCTTTATAGTCAAATTCTTG m^Te^alCeq 

CATCACTAAATTTTCACCGT TCATATCTTCATACAAACAAGATAACATAC Majority 
13310 , 13320 , 13330 , 13340 13350 

13 J67 C A T C A C-T A A A T T T TC A C G G T T C A T A T C T t G A T A C A A A C A A G A T A A C.A T AC 2603 ai2,seq 
13167 C A T C A C T A. A ATTTTCAC.G G T T C ATA TCTTGATACAAACAA GATA ACATAC ne^Te^all^eq 

CGACCTTAGGTAAATGAAGG TAATTTTCATAATTATCTATCAAATCACCT Majority 
. ' 13360 13370 , 13380 , 13390 13400 

^ACCTTAGGTAAATGAAG.GTAATTTTCATAATTATCTATCA.AATCACCT 2603_.ai2.seq 
13217 CGACCTTAGGTAAATGAAGGTAATTTTCATAATTATCTATCAAATCACCT ne»316_al2^eq 

AGGACAACCGAATCTTG ATCTAAAGTCAAGAACCAATCAAATTCTTGTGC Majority 
13 * 10 13420 13430 13440 .. 13450 

13267 AGCACAACCGAATCTTGAT'CTAAAGTCAAGAACCAATCAAATTCTTGTGC 2603_ai2.seq 
13267 A G G A C A A C C G A A T C T T G A T C T A A A G T C A A C A A C C.A A T C A A AT T C T T G T G C nem316_al 2. seq 

TACTGCAAATTGACCGATACAGTTCAAAGC ATATGC AATCCCTTTATTTT Majority 

13 .? 60 13 /*7Q 13480; . I 13490 ' l&m' 

13317 T A C IC CA'A ATT.G ACCG ATAC ACT TC A A AGC.AT A'.TGC AATCCCTTTATTTT 2603^12 seq 
13317 T AC TG C.A A A T T G A C C G ATA G A G T t C.A -A A G.C ATA T G C A A T C C C T : t -T A T T TT neri3?6_ai2^ 

C T G t T A A AT A A T C A A C A G T T A G. G T G CC C C T CTT C A X T A T A A .T C G GCt AC X Majority . 

* 3 ? 10 ,13526 , 13530 13540 13550 

13367 C T G T X AAA T A A T C A A CA G T T A CGI G C C C G TGTTCAtT A T A A TC C G C TACT 2603 ^iseq 
13367 C T G T T A A A T A A T C A A C A G T T A G G T G C O C C T C T T £ A T T A T A A T C G G C t A C T nenSlG^aii^eq 

A A T T G A G A A A T T T C T T C C T T A T T T T T C G A G C C A T T A T C T A C C A T. A T A G A T Majority 

- . . » . . ... j 3 ? 60 13570 13S80 13590 13600 

J 3 /}}* A.A T T.G A G A. A. A T T T C T T C C T T ATT T T T C G A G C CA T T A T C T A C G A T A T A G A T 2603_jU2:seq 
13417 AATTGAGAAATTTCTTCCTTATTTTTCGACCCATTATC.TACG ATA T A G A T nem316_Sl2^ . 

GTGGCTTACjrTGAGGATAA ATTGCTCGAATGTTCTGATCTAAGCGTTCAA Majority 
J 13610 , 13620 13630 13640 13650 

13467 GTGGCTTACTTGAGGATAAATTGCTCGAATGTTCTGATCTAAGCGTTC A A 2603 ai2.seq 
13467 G T G G C T T A C T T G A G G A T A A A T T C C T C C A A T G T T C T G A T C T A A G C G T T C A A ne^Te^a i 2 seq 
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lL " i36€oO ii IU ..-tajfro ,«' IU I. :»* ^so 136 90 13700 

ran TaTtITITT^^ aeoa^aia.seq 

13517 TATTGCGGTTAAAGGTGACAATACCCGCTAAATATTTCATGTTCTATGCT ne*316._.al2^eq 



C 



TTTTCTAAAATCTCTAAA T AACTCAATGACTGGTGCTTTGGTTATAAAA Majority 
13710 13720 13730 13740 13750 



13567 CTTTTCTAAAATCTCTAAATAACTCAATCACTGGTGCTTTGGTTATAAAA nem316_ai2.se<j 

ACGATACCGACATAGATAC TTACTGCTACTAAACTTTGAATGACATAATT Majority 
' ; 13760 '3770 13780 ~ 13790 - 13800 

13617 ACGATACCGACATAGATAGTTACTGCTACTAAACTTTGAATGACATAATT 2603 al2 sea 
13617 ACGATACCGACATAGATAGTTACTCCTACTAAACTTTGAA?GACATXAT? ne^lS^iz^eq 

TACCAATGATACTGACATT TGAGTATTGATATAA T.A GACTAC ACCTCCAC Majority 

, 13810 , t3820 13830 13840 13850 

TACPAATGATACTGACA T T T G A GTATTC ATATAATAGAGTACAGCTCCAC 2603 ai2 seq 
13667 TACCAATGATACTGACATTTGAGTATTGAT AT AAT'AGAGTAC A*G Cf C C AC ne^Te.ai^eq 

TA AG ACTAGCAGCAATTA AATACCGCAGCATTCCTCTTCTTAATTCTTTA Majority 
13860 l3 *™ 13880 13890 . 13900 

13717 TAAGAGTAGCAGCAATTAAATAGCGCACCATTCCTCTTGTTAATTeTTTA 2603 ai2 sea 
13717 TA AG AGTAGC AGC'AATTA AATAGCGCAGGATTCCTCTTGTTAATTCTTTA 22lK& 

AAAGTAAATACATCTCTTA AAGAGATAGCTTGATATAGGGAGACAATAAA Maloritv 



139 lO 13920 13930 13940 



13950 



J 376 Z A A A 5 T A A A T ACATCTCTTAAAGAGATAG'CTTGATATAGGGAGACAATAAA 2603 ai2.seq 
.13767 A A A G T A A A T A C A T C T C T T A A A G A G A T A G C f T G A T A T A G G G A G A C A A T A A A nem376_ai2?seq 

T f C A C T A A T A A C T G T A G A G A T A A T A G C T C C C A T A G C A C C T A A A A T T G G T A Majority 

^ * 3 ? 60 - 13970 • 13980. ' ; 139 90 14000 

13 *} 7 TTCAGTAATAACTGTAGAGATAATAG C T C GCATAGC A C C T A A AATTGGTA 2603 ai2.seq 
13817 TTCAGTAATAACTGTAGAGATAATAGCTCCCATAGCACCTAAAATTGG T A neaalelail^eq 

;TTAAA AGTATATTAAGCACAACATTTGCCACAAGTCCAATAACTGCAGAC Majority 
, 14010 14020 14030 1 4040 * U050 

III- .33 AAAA ?I ATiATTAAGCACAACA TTTGCCACAAGTCCAATAACTGCAGAC 2603 aiZ.seq 
13867 TTAAAACTATATTAAGCACAACATTTGCCACAAGTCCAATAACTGCAGAC nemsTe.aiZ^eq 

ATTGTGTAAGCT TTTGTACGTCTTGAAGCC. AGTAGATACTGTCTCCCTAA Majority 
. 14060 14070 14080 14090 14100 

]f a \Z ATTGTGTA AGCTTTTGTACGTCTTGAAGCCAGTAGATACTGTCTCCeTAA 2603 ai2.seq 
139!7 ATTGTGTAAGCTTTTGTACGTCTTGAAGCC AG TAG AT ACT GTGTCCCTAA nemsTe.aittL, 

. AGCGTTACCATA AGAAATGCAAATGATCATCAAA Majority 

1411 0 . . • .14120 14130 

13967 A G.C GT.T.ACCATA AG A A A T G C A A A T G A T C A T C A A A 2603 ai2 see 

13967.AGCGTT ACC ATAAGAAATGC AAATG ATG ATCAAA - h^Tfe.aiSeg 
Deooratlon 'Decoration #1': Shade (with solid black) residues that differ from the Consensus. 
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XjLC^G ^^ T C C r c r at C A C C A C T A A T T r. r. r. j^I^ ' 

l P 20 30 do 



51 
51 



101 
101 



151 
151 



201 
201 



251 
.251 



301 
301 



351 
351 



401 
401 



451 
4SI 



501 
501 



551 
551 



601 
601 



— £_ T 30 40 so 

T C C A C A T C C rTrri^^T-^^^^^^^*^^^^^^ "f^C CAGTAATTCGC oohl al 2 sen 

TCCACATCGGTCCAATTAAC-ATATGACGTGGCGCATCACCAOTAATTCGG So^lL's^ 

TGAATAACAATATGTTTTGGAATAATCTCCA GTTGGTCACAAATAATCGA Majority 
7 P • «g SO 

TGAATAAC AATATGTTTTGC AATAATCTCCAC TTGGT CACAAATA A T f r A „ 

TGA ATAACAATATGTTTTGGAATAATCTCC AGTTGGTCACAAATAATCGA a909Zai2*s^ 

AATATAGTCTTCTTGACTTAACAAACGTAAAC GACCTTCATCGTAATCTC tt*,^ 
- : ™ *30 14 p ~Z 0 

A A T TlITZllT^^ cohl.al2.seq 
AA r ATAGTCTTCTTCACTTAACAAACCTAAACGACCTTCATCGTAATCTC a909_ai2.seq 

TCTGCATTCTTGTAT TAGTCA TAACATCCAGAACGTGTAATTTTATACCr Majority 
^ _ *?° '?0 190 200 

tTtTc^t^^ 

1CTGCATTCTTGTATTAGTCATAAGATGCAGAAGGTGTAATTTTATACCC a909„ai2.seq 
TGAATATCATTATCCGTAACACATCGACCAA CA TT.T TCCACCATCA T A T r. Majority 
_ 2 *° . ■ 2 ?° 230 240 

TGAATATC.ATTATCCCTAACACATCCACCA A CATTTTCCACCATCATATL h! ^ 
TGAATATCATTATCCGTAACACATCGACGAACATTT™^ 

ATCTGTCTCC G CT CGG AG AC CA TT.T A T T A. G G T G A. G A A A C G ATTTCTACTT VL^ r ^ 

■ • ^° 280 ... , 290 " 3^ 

A^GfGT^^ cohl.ai^.seq 
ATGTGTCTCCCCTGGGAG AC CATTTATTAGGTGACAAACG ATTTCTACTT a909_al2;seq 

TAGCAGCTAATTCTCGTATTCTCTTAACA G TT T T T T TCT AAAGCTC ATA.T Malnrlt v 

3 ?° 320 330 340 _ ~Z Q 

TAGG^CCTAATTCTCGTABt.CTCTTAA.CAGT TTTTTTGTA AAGGTCATAT rnh» ™ 
T A G C A.G C T A A T T. C T C G T A T T C T C T*T A A C A G T T T*T T T'T GTAAAGGTt AT A T a009Zal2[se^ 

G A ATGTGCTCTATTTAf TA ATGCAGAACTTGC TTC ATAAGTTGTCTGAAG Majority 

3 ?°. 3 T° . 380 39b ~~^o 

P A.A-T G T G C T C T A T T T A T T. A A T G C A G A A G T T C C T T C A T A A C T T r T r t r a a r ^u* 
GAATCTGCTCTATTTATTAATCCAGAAGTT'GCTTCATAAGTTCTC. TGAAG a909Zal2!seq 

CCC-TAATTCTA AAGT'CAC ATGCAT T*'C TTTCAGAAAGTTCAGCGAGATAGT Ha jorlty 

112 *?<» 440 4I0 

CCCTAATtC T'A AAGTCACATCCAT t'C TTTCAGAAACT Tplrrr'ir a t a <- t« u, ., 
GCCTAATTCTAAAGTGACATG.CATTCTTTCAGAAAGTTCAtiCGAGA^-AG'T "a909Zar2lseq 

ATATAGTTTCATCACGTAAGCAATCCGGCC.T TGTTCCCATGTTCATCCGG Majority 

■• ' - *f° ■ . *?° " ■ . W . ■ ■ 490 ~ ' 

A T A T A C T T t .C A T C A C G T A A G C A A T C C C C C C T T G T T C C G A T G T T C A T C C C C ™M 

A* A.T.ASTT T O A T C A W ti A G C.A A t,C C G G C C T T G T i C C G A T'G I T CAT CO C.G a^lf S 

■ A TAA C T CO TGGCTC At TAA T A G O C T G T T C C T A A C G C T O TT T A A TT A T C T f! ' fainr!^ 

^ ; ■ ;- 530 . - . >v ~ ■:■ ' 

* T A * C T OC fGG CT C A T T A A T A G C C T C T T C G T A A C G C T C T T T A A TT A T C T C c«hl »17 

A T A A C T C C T<5 G OT C A T T A AT A G C C T GT T CO T A AC G C T C ll.J AATTATCTC So^alt^ 

.TAACTTAGCATGGCTATTGGTAAA.ATTTTG AAAATAGACTAAGTATTTAT Hajorlty 
560 . £7ft xttn r-~„ IT" - - 



- . ■ ■ S ?° ^80 590 • 600 

^'!SH!S5 A;TG(;<; ' TnT ^ TAA.AATTTTGAAAATAGAc1a.AGTA T T TA T cohi ai2 sea 
TAA CTTAGCATG G G T A T T-G G'T A A A A *T T"T-T G A A A A T A C A C TA AGTAtTTA.T a^ta!^ 

TAA CC TC.AGG'CC ACTTTCTATGC ATGAAA TCA ATTT CT-TTATAGAATTGT- Majority 

32 630 640 ^ 

TAACCTCAGGCCACTTTCTATGCATGAAATCAATTT CTTTATAGAATTGT mhi ™ 
TAA'CCTCAGGCC ACTTTCTATGC ATGAAATC A ATTTCTTTATAGAATTGT a909Z^l2lseq 



• '"/4«7 
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XCAC ^g^C jajg ^jgj KM TAGCATCCCCTCAACCAG A A A C Ma,o,-,. v 

6 f° 670 680 690 700 

651 TCACCAATAGGAGCTTCTGG AGCAACTATAGCATCCCCTGAACCAG A A A C a909.„ai2.seq 



TGTCCAAAAAGTCCACCCTCCTCTACCAAC 



7 }° 720 730 740 



CTGTTCCATCTCTGTTA GG AC Majority 



750 



™* T.GTG.CAAAAAGTGCACCCTCCTCTAGCAACTGTTCCATCTCTGTTAGCAC cohl ai2 sea 
701 TGTGCAAAAAGTGCACCCTCCTCTAGCAACTGTTCCATCTCTGTT.AGGAC SS^Si 

.AG-TCAAAACCAGCATC. TATAGG TAATTTAAATATTTTTTCTCCAAAGAGT Malorltv 
• J^O 770 780 790 800 

751 AGTeAAAACCAGCATCTATAGGTAATTTAAATATTTTTTCTCCAAAGAGT a909 a!2 sea 



SOI 
801 



TCTCCATAATAATCATTAA T CGCACCATAACGTTTTTTCATACGATAATT Majority 
. ' 8 j: Q 820 830 840. _ 850 

^^^rM TAATCATTAATCGCACGATAACGTTTTTTCATA GGATAATTcohl ai2. 
TCTCGATAATAATCArTAATCGCACGATAACGTTTTTTCATAGGATAATT a909Zai2; 



GTATCACAA TTTTAAC TAAAATAACCTCACTACTACAATAAAACTAAAAA Majority 
S( p ' 8J0 880 890 900 

ffj "J J A J H A « A ^IIli AA ?I AAAA I AACCTC A C TACTACAATAAAACTAAAAA cohI_ a i2_ 

.seq 



r t i t 1 p V 4 -r ^« -r <r • V « ^ ? n t\ is \s 1 a. 1 a 1 A u A A 1 A A A A C -1 A A A A A cohI_ai2_seq 

S51 GTATCACAATTTTAACTAAAATAACCTCACTACTACAATAAAACTAAAAA a909_ai2_ S ~! 



A GATTGGAACCTC AG T"T A GTTC C AATCTTTT ATTTA CT TCACTTTCTTTA Majority 

. ^ ^ 930 940 950 

fOJ A ^ A IJ G 5 AA ^TCAGTTAGTnCCAATCTTTTATTTACTTCAGTTTCTTTA cohl ai2.seq 
901 AGATTGGAACGTCAG.TTA GTX CC AATCTTTTATTTACTTCAC TTTCTTTA- 3909^12^^ 

A CG A- ATCCTTGGCTAAAAA GAT. AT ACGCA.GTTAG ATT CAAAATACCATAA Majority 

... • . ,. Q?° 970 . 980 990 1000 

If? A S 5 A f 3 C C T * G 6Blf A A A A A G A T A T 'A C G C A G T T A G A T T C A.A A A T A € C A T A A cohl- ai2.seq 
951 AC.CAATCC.TTGGCTAAAAAGATA. TACGCAGTTAGATTCAAAATACCATAA a909lai2.seq 

GCAAC.TA.TAAAACCAGCT AAAACATC.TGTCGGAAAATGAACCCCTAGGTA Majority 
; • l0 }° 1020 1030 1040 1050 



lOOl GCA.AGTATAAAACCAGCTAAAACATCTGTCGGAAA 



10OI 



ATGAACCCCTAGGTA cohl_a!2.seq 



GCAAGTATAAAACCAGCTAAAACATCTGTCGGAAAATGAACCCCTAGGTA a909^.ai2[ 



AATACGAGATAACCCAAT TAAAAAAATGAGCAAACCCAATGTACCTTGGC Majority 
. 1Q70 1080 1090 1100 

A A T A C c A G A T A A C C C A A T T A A A A A A A T G A G C A.A A C C C A A T G T A C C T T G G C cohl ai2.seq 
1051 AATACGAG AT AACCC A A T T A AAA AAATGAGCAAACCCA aQQ T A CCTTGGC a909.al2.seq 



A C A A C A G T T T C C A T A T A C T C T T A G G C A T A T A G T A G T G C A A T A A A A T A A T A Majority 

- H*° 1^0 1130 1140 1150 

1101 A C A A CA-'G, T T T CC.A TATA C ~T C TTAGGC AT AT A G T A C T G C A A T A A A A T A A T A cohl aiB.seq 
. lit*! A C A A C A G TT T C C A T A T A C TC t-T A G .G C A T. A T A G T A C T C CfA A T A A A ATA A T A .a90?Zai2^. 

C T A C T CC C A A AT AT- C.A T A A A T G T t C C C A TCCA G T G C CCA Ct C C & A A A CCA MaWltv 

- - fjiSL: : H 8Q . - • 1190 '• 1200 : 

utl M v i £ i n r £ * * * T A J £ A T > A A T G T T ^ A T C G A C T G C -C C A C t G G G A A A C G A cohl_ai2.s^q 
US* B T A C T C C .C A A A T A T C A T A A A T G T T C C C A T C G A G T G C C C A C T G G G A A A C G A a909_al2.seq 

- . A T A G C C A C C t G C A A" A T A C T A A A T G CG TTA A A C T T G G T CT C A C T C T T T G AA Majority 

Jgg . *220 1230 1 2 40 • 12.0 

I20i A TAG C CACCTGCAA AT'ACTA A A TG G GTT A A A G T T G G TC T C A C T C T T T G A A cohl aiZ.seq 
1201 AT ACC'C A.CCTGC AA'ATA C T AA.AT GCGTTAA AGTTGGTCTCACTCTTTGAA 'a909lal2. S eq 

A A A T A AG TTTTA AA G A A AGTAT.AC ATA.TA CC AGAGATA ATA GCA TTT A CT Majority 

. *260 . 1270 . 1280 1290 1300 

1251 AAATAAGTTTTAAAGAAAGTATA C"A TATACCAGAGATAATAGCATTTACT cohl aI2.seq 
1251 AAATAAGTTTTAAAGA A. A GTATACATATACCAGAG AT AATAGCATTTACT a909_al2.seq 
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n roetiwJTOth Weighted residue weight table. 



(SSSS? fc2TX°!WO 2006/0 78318 mett~th Weighted residue weight tatte. ^PCT/US2005/027239 3 



C C G A 



¥ tf IV Qj ^jjPg ?g!9 TCTTAAGCTA AC A .^, \ G T G A C * JoTT^ 
13 lO i^n i««n 



13 . 10 1320 1330 1340 1350 

GCGATAAATCTAGCTTGAGGATACCACTTCTTAAGGTAACAGAAAGTGAC cohl al2 seq 
GCCATAAATCTACCTTGAGGATACCACTTCTTAAGGTAACAGAAAGTGAC aSOO^aiaiseq 

GCTC ATA ATCGC A ATAGC TATCTGGCTTAC ACTATTACCAACCAC ACTG A Majority 
1370 1380 1390 1400 

GCTCATAATCGCAATAGCTATCTGGCTTACAG T-A TTACCAAOCACAG.TGA cohl a!2 seq 
GCTCATAATCGCAATAGCTATCTGGCTTACAGTATTACCAACCACAGTGA a909lal2.seq 

TTAACTTGAAAA ATCTTGTACAAAGATTTCGCAACTGTCCTCTAACACTT Majority 
1*10 W20 143Q 14 40 1450 

T T A A C T TH A AAAATCTTGTAGAAAGATTTGGCAACTGTCCTCTAACACT T cohl_ai2 seq 
TTAACTTGAAAA A.T CTTGTAGAAAGATTTGGCAACTGTCCTCTAACACTT a909_ai2.seq 

TCTTGAATAGTT TGGTCAAATGCGATTA.CA.G.TGT.CGGGCCAATATTTGAT Majority 
MSG 1_£0 1480 1490 1500 

- — TCTTG AATAGTTTGGTC AA ATGCGATT ACAGTGTCGGGCCAATATTTGA T cohl a 12. seq 
1451 TCTTGAATAGTTTGGTCAAATGCGATTACAGTGTCGGGCCAATATTTGAT a909_.ai2.seq 

G A CCAATCCTAAACTGAAAA.ATAAGATAATAGCAA T.A A A TGCTTGAATAA Majority 
1S10 1520 1530 1540 . 1550 



1301 
1301 



1351 
1351 



1401 
1401 



1451 



1501 
1501 



.1551 
1551 



CACCAATCCTAAACTCAAAAATAAGATAATAGCAATAAATGCTTGAATAA cbhl_al2.seq 
GACCAATCCTAAACTGAAAAATA A.G ATAATAGCAATAAATGCTTGAATAA a909_al2.seq 

G T T T A C T A T T T T G A C G A G A T A A G A T T A G T C T T T T T A T A f C T T T C T A A T A T Majority 
• 1560 1570 1580 1590 1600 

GTTTACTATTTTGACGAGATAACATTAGTCTTTTTATATCXTTCTAATAT cohl ai2.seq 
GTTTACTATTTTGACGAGATAACATTAGTCTTTTTATATCTTTCTAATAT a909_ai2.se^ 

TGGCAAACAAGC CACCTAAGTTAGATAGAAAACAATGGAAATTAAAATTC Majority 
1610 1620 1630 1640 



1650 



1601 TGGCAAACAAGCCACGTAAGTTAGA.T.A G A A A'A C A AT C C A A A.T T A A A A T T C cohl_al2:seq 
,1601 TG G C A A A C A A G C C AC GT A A G T T A G AT A G A A A A C A AT C G A A ATT A A A ATTC a909_ai2.seq 

CCTCAACGA T A TTAAATGGAATAACCATTGTTAAAAGGTAATTGCCTA C-A Majority 



1651 
-1651 



1701 
1701 



_____ 1660 1670 1680 1690 1700 

CC'TC AACGATATTAAAT CCA ATA ACCATTCTT AAA A OCT A AITCCCT AC A cohlL_al2 seq 
CCTCAACGATATTAAATGGA'ATAACCATTGTTAAAAGGTAATTGCCTACA a909_ai2.seq 

CCAATAAATCTT CTCATATCAAACTTACCAAATATAflCATACAAACCAAT Majority 
1710 1720 1730 1740 1750 

C C A A T A AATGTTCTGATATCAAAGTTAGCAAATATAGCATACAAAGGAAT cohl.ai2.seq 
CCAATAAATGTTCTGATATCAAAGTTAGCAAATATAGCATACAAAGGAAT a909_al2.seq 

C G C A A AG A C A T A G T T C A G A G C T A C C A T A G A T A CA G T C A A CCTAACTGTAC Majority 
. . . t760 '1770 . 17 80 1790 1800 

.1751 C G C A A A 0 A C A T A C T T C A G A G C -T A C C A T A G A T A C A G T C A A G C T A A C T G T A C cohl ai2.seq 
175J C,G C A A A G A C A T A G TT G. \M -A 0 C T A C C A j A G A T A OA G T C A A G C T A A C T G T A G a9G?^i2.2q 

G A A A T A A A C T A G C T TT. A A TJjC A A A T'C T T T T C C A C T C T.C T C T.A TTT!TT CC A G Majority . 

.. . 181Q \ 1820 . . 1830 . 1840 1850 

1801 -C A A A.T A AA C T A G C T TT A A T A A A A T C T T T T G C A C T C TCTCT AT TT TTCCAG cohl_jal2 sea 
4801 C A A A T A A A C T A G C T T T AA T A. A A A T G T T T TG C ACTC TC TCT AT TT TTCCAG a909_ai2."seq. 

. A A A A f A G C G A A A C T T G C T A A A A A T A.G AG C T A G A CC A A C C A T ATTCATCC G Maloritv 

I860 " . 1870 . 1880 1890- 1900 

1851 A A A AT A CC GAAACTT G. C. T A A .A -A -A T A G A G C T A G A G C AAC CAT "ATTC AT C.G G cohi_ai2 seq 
1851 A A A A T.A G C G A A A C T T G C T A A A A A T A G A G C T A G A G C A A C C A T A t T CA T C G G a909_ai2.seq 

. TAAACCCATAAA GGTTTCTGGACCACGATTAGCAAGTATAACTTTTAAAA Majority 
. ' 191<> 1920 1930 1940 1950 

TAAACCGATAAAGGTTTCTGGACCACGATTAGCAAGTATAACTTTTAAAA cohl_;ai2.seq 
T A A A C C G A T A A A G G T T T C T G G A C C A C G A T T A G C A A G T A T A A C T T T T A A A A a909_ai2.seq 



1901 
1901 



FIGURE 2{b 
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GTGk T P 5 1 V y SftCS^S M5S?y T ? AT TTCAAATCAAATAAAATA , Majority 
1970 1980 1990 . 2000 

!o*I r G ^ AT CTTAATAACAGTACACCATAACTTGATTTCAAATCAAATAAAATAcohl aiZ.seq 
1951 CTGATCTTAATAACAGTACACCATAACTTGATTTCA A-A TCAAATAAAATA aSOoIaiZ.seq 

AAAGCAACTA.ACATCGG AAGGATTGAAAAATCAAGCTTTAAAAATTCTGC Majority 
' 2010 2020 2030 2040 2050 

AAAGC.AACTAACATCGGAAGGATTGAAAAATCAACCTTTAAAAATTCTGC cohI_ai2 seq 
AAAGCAACTAACATCCGAAGGATTGAAAAATCAACCTTTAAAAATTCTGC a909_ai2.seq 

- . TCCTGGTATTAATGGAAATG AAACCATCATCAATACAAAAG AT A A G G C A C Majority 

2 °?° f°™ 2080 2090 2100 

^f! iCCTGCTATTAAT : CGAAAT€A.AACCATCATCAATACAAAA.GATAAGGCAGcohl al2.seq 
2051 TCCTGGTATTAATGGAAATGAAACCATCATCAATACAAAAGATAAGGCAG a909lal2.seq 

A A A C AATGGC GATTGTCA CCATTTTACGTGTATTTGTCATAAAAAAATTC Majority 

_ 2 * iQ 2120 2130 2140 2150 

2101 A\A AGAATGGCGATX G T C A C C A T T TT A C G T G T-A T T T G T C A T A AAAAAATTC cohl_ai2.s 



2001 
20OI 



2101 AAA G A A TGG C GAT.TGTC ACCATTTT AC GTGTATTTGTCATAAA AAA A T T G a909_al2.^ 

C T C C A A TTT AAATAAATTCAAAGAAGCTCCAAAGGTAAGCCTAGCTACGC Majority 

- • 21 t 6Q 2170 2180 2190 2200 

2151 CTCC AATTTA AAT.AAATTGAAAGAAGCTCCAAAG.GTAAGCGTAGGTACGC cohl_ai2 .seq 
2151 CTCCAATTTAAATAAATTGAAAGAAGCTCCAAAGGTAAGCGTAGGTACGC a909_al2.seq 

C A A A A A A.AC C TTT GTCT TCTCC C'ATC CA GA G' T T T A C T G T C"G G T T G. T G G A A Majority 
2210 . 2220 2230 2240 2250 



2201 G A A A A A A A C C TTTGTCTTCTCCCATCC A GACT.TTACTGTCGGTTGTGGAA cohl._ai2.seq 
2201 GAAAAAAACCTTTGTCTTCT CCC AT CCAGACTTTA C T G TCGGTTGTGGAA a909.jai2.seq 

T C T C A C C A C A T C A G C T T T C G C T C G C G G A C T G A T G C T T C A C A A C T G A C A A A Majority 
. 226Q 2270 , 2280 2290 2300 

.2251 ?CT C A C C A C A T C A GG f T T G G CT C G C G G A C T G A T 0 C T T C A C A A C T G AC A A A cohl_jal2 seq 
22S1 T C T C A C C A C A T C A G C T T T C .G C T C G C G G A C f G A T G C T T C A C A A C T G A C A A A a909_al2.seq 

TAAGTTGCAAG GCATTACCCCCGGTCGGGA ATT A C A C C C T G C C C T C A AG A Majority 

23tt> 2320 2330 . 2340 2350 

2301 "T-A A G T T GGAAGCGATTACCGCCGGTCGGGAATTACACC C T GCCCTGAAGA cotil_aI2 seq 
230I XAAGTTGGAAGCGATT. A C C CCC C CTCGCC A ATTAG AC CCTCCCCTC A AC A aOOO.a^seq 

CACCTATAGCA TAACAAAAAAAACTTGCAATTGCAAGTTTTTTAATCACT Majority 
■ 2360 2370 2380 2390 2400 

23S1 CACCTATAGCATAACAAAAAAAACTTGCAATTGCAAGTTTTTTAATCACT cohl_ai2.seq 
2351 CACCTATAGCATAACAAAAAAAACTTGCAATTG.CAA G T TT TT T A A.TC ACT a909__ai2.seq 

AATTAGTAGTAGATTGTATAATATTAATTTTTAACATCAATTAATTGAC.A Majority 
' • 241Q 2420 .2430 2440 . 2450 

2401 A A T T A G T A-G T A G A T T G T A T A A T A T T A A T ttjt A A C A T C A A T T A A T T. G A : O A ' c6hl_al 2 . seq • 
2401 A A T T A G T A G T A G A T T G T A T-<AA T A T TA AT TfTT A AC AT C A ATT A, A TT G A C A a909_ki2.seq 

G C G € A C -T A A T. A C T C T A C C T A C T C C T G C C T- T T.C T A C A A G T A A A C A A C C T T A Maidrltv • 
• - 2 ^_L Q ' * 24 7° - 2480 •■."■*; 2490 2500 



24£>1 --GC G* C ACTA AT AGTCT A GC T AC T.CCTGCCTTTGTACAAGTAAACAAGCTT A * cbM^i2,seti 
.2451 G-C.G CACTAATACTCTA GC TACTCCTGCCTtTGTACA AGTA-AACAAGCTTA a909_ai2.se*. 

A G T C C C A A T G A T T G T C T C A TG T G G C A G T T T TA T A A A C TTTTT C A A T C G C T Majority 

* 2s t t° / 2520 ; 2530 • ; 2540 2550 

2501 AGTCCCAATC ATT-GTCT'G A T G TG G C A G T T T T A T A A A C T T.T TTCAATCGCT cohi_a!2.seq 
2501 -ACT CCC A.ATCATTGTCTG ATGTG G C A G T. T T T A T A A A -C. T T T T T C A A T C GCT a909_al2.seq 

GTTGGTTCAATA ATTTCTCTATTACTGATTTTGTAGTGATAG AT TTGCCC Majority 
. ' 2560 2570 2580 2590 260Q 

2551 CTTGGtTCAATAATTTCTCTATTACTGATTTTGTAGTGATAGATTTGCCC cohl„a!2 seq 
2551 CTTGGTTCAATAATTTCTCTATTACTGATTTTGTAGTGATACATTTGC.CC a909_al2.seq 
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LX Plfe S fT ' G f ffl | T " A . TiF^"? CCCATATCTACAT TTTTTAAAC 

2610 2620 * 



T G T 



2601 
2601 



2640 2650 



TgTtITa^ ai2.seo 
TGTTGTAGTTGTAAAATAAACATCCGTTCCCATATCTACATTTTTTAAAG aQOflTalZ^j 

CATCAAAATGATAAGGAAAATTATGCGCAC A AATCACCATGTTATTAGTT Majority 
_ 26 . 6 ° fj« 263Q 2690 - ' 

fell cllJYYtY^ 

2651 CATCAAAATGATAAGGAAAATTATGC GCACAAATCACCATGTTATTA GTT agOsCaiZ-seq 

AAATAAGAACCATA ATACC TtCTAGGCGTTTTAGACACTTGTTCAAAACT Majority 
' 27 . 1Q 2 ?° 2730 2740 2750 

270I 1 AAATAAGAACCATA^T Arr^ TGTA< * G ^ G T ^ T T A.CAC A CTTG.TTCA AA ACT cohl alZ.seq. 
2701 AAATAAGAACCAT. AATACCTTGTA-GCCGTTTTAGACAGTTGTTCAAAACT a909lai2.seq 

ATAATTAGCAGCTACCGGTAAATGCAGTTT T AAGTTCCGAATATCCAGAG Majority 
2770 278Q 2790- 2^00 

ATAATTACCACCT^ cohI_aI2,sea 
ATAATTAGCACCTACCGGTAAATGCAGTTTTAAGTTCGGAATATCCAGAG a909_ai2.seq 

TTCCCA ACTA ATCTGTTTTATCC A ACTTTTTTAC ACCTA AT TCTCC ATTT Majority 
. ™?° . 283Q 2840 2850 

TTC CC A ACTA ATPTf T T ttat^^^^^^^^^ ^ ^ ^ ^ ^ C ^"^ A ATTCTCCATTT cohl_al2. Sfeq 
CCA AGTAATCTGTTTTATCCAAC TTTT T T AC ACCTA ATTCTC CATTT a909Zal2,seq 

TC. TGAACCCT TTAC TTC ATGCGTAATAC AT- T TATCA ACCGC C TT C A C A A T Majority 
. 2860 2870 ' nsLtn ZZZ ITT- 



2751 
2751 



,2801 
2801 



2851 
2851 



. 1 ; . ■ — t - 2890 2900 

tTtT^ coM-.aI2.sea 
ITGAACCCTTTACTTGATGCGTAATAGATTT.ATCAACCGC CTTG AC A A T a909_al2,seq 

-A TGCTGAGAACTTA A AT CAC C TT C A T C C C C C T G A T T A A T.A T T A TA.CCAAC Majority 
/ 29 . IQ *f 0 293Q, 29 40 ■ 

"a tH C T C AGAAGT T A a t r a r V «r'/» ATGCC.CCTGAT T.A ATATTATACC'AAC CQhl_ai2. > seq 
2901 A THC T G A G A A G T. T A A A T C A G C T T G A T G C G C OTG A.T T.A A T A T T A T A C C A A'C a909lal2.seq 

CCCAATAG ATTCC AGA ACTTAC CAGA ATG A T TCCC AGTATC GCTA A A A A A Majority 

■ 29 , 6 ° -gyO 298Q 2990 30^0 

•CCCAAT'ACA TT CCACAAC T T A C C A G A A TKl A TTC CCAKTA tH r r t a a a a a ' a u. 
2051 XC'C A AT A G A,TTC'C AG A A'CTT ACC A G A A T G A TTC cQ A G T A T^ CCTAAAAAA a909^ai2^seq 

TTTGCTCAATAT CTTCTAA TCACGTCTTCTTCTCCATTTTAAGGCTATTA Majority 

^ f 0 ^ 3Q30 3040 3^0 

- 001 ^'TTCCTGAATAT'CTTCTAATC'AC CTCTTCTTCTCC A TTTTA aR I r f t a t t a mm 
■ 3001 TTTGCTTA atati-t't^i-t-a i ^ ^ 1 ^ ^ _ ^ * . 1 AAA AfjJ I* L 1 A T T A cohl_ai2.seq 

■auui TIT. GCTGAATATCTTCTAATGACGTCTTCTTCTCCATTTTAAGGCTATTA a909_al2.seq 

T t A A A C A C A G A A G T C C T G A C A T A A T T A G T A T A G G T A T t G G C C A C C A T A C T Majority 

' 3yby ' SOTO' , 3080 , 3090 3100 

T'TAAiCA.CA C; A A GTCCTGACATAATTAGIA T-A OCtiTTPrrr a ■/* r» a t * i • ... 
3051, TT A A AC AC A G-A A,TCCT,A CAT A A j T AG J A T A GG I AT T G 0C C A C b A £a 0 T SKES^.- 

T G T CCA G T A A A C G G E A C C TT.TC C C T TT G T C T G AT G T C T f A CTCT AG G A 6 t' Mal^rYrv 
~ ■ 311Q:. ^ . .^3120 ; 3 13Q 3 140 . 31S0 '. 

t l0t TGTC C A G T A A A C GGB A GCTTTCCCTTTG TC T G A T <? T G T T A C T G T A G G A G T ™hl »i? ^ 
3101 T G T C C A G T A A AC G-GG AGCTTTCC.CTTT ^ T ^ A T G T 6Qt AC'TG T A C G ACT a909Zal2*seq 

A ATTCTCTCTT C-T T T T TT A G fi T T TA.G C AT T T A A A G G GC T C A T T T-T C'T C A A M,Wir v 

: ^ 3l 7° • . 3190 . 3200 

a* a ^ I ^ T T C T T ^ T TT A G G. T T T A G C A T T T A A A C C G C T C A T T T T C T C A A cohl_al2 sea 

A A TBI G T C T C T.T C T T TT T T A G G TT T A G C A T T T A A- A G G G C T C A T T T.T C TC A"A. a909Tal2!^^| * 

ATCCTCTAATATCCTACTTCCC'ATCCTTAC f, TAfTGATAGTATAAAGCGA Majority 
32 .'° 322P 3230 3240 , 3250 

•aJ^^3 AATATCGTACTT ^ CCATCCT T AG GTATTGATAGTATAAAGGGA cohl al2 sea 
ATGCTGTAATATCGTACTTCCCATCCTTAGGTATTGATAGTATAAAGGGA 



3151 
3151 



3201 
3201 
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3251 
3251 



C A C A A EBB *T r 3i:*i ffit B3 U -TTVT^ T G A A T A A A T A G A T A A A T Majority 

3260 3270 3280 3290 3300 

CACATTAGTTCATAACCTTCAG Cg G TTTTAGTCTCAATAA AT ACATA AAT ™hi ^ ™ 
CACATTACTTCATAACCTTGAGCTGTTTTACTCTG^ 

CCCTTGA GGA ACATTCTTCGC AAC A AT A CCTTC AGCCCGTAAATTATCAA Majority 
3310 ' .w.n — ,~ ~7T_ 



, 3^30 3340 3350 

3301 G CCTTGAGGAAGATTGTTCCCAACAATACCTTCAGC CCGTAAATTATCAA cohl ai2 sea 
CCCTTGAGGAAGATTCTTCCCAACAATACCTTCAGCCG.GTAAAT^UUa«:S 

ACCTTTGTAAAGGTTGACTTTTATGAACA GCTTT'TGTTACTACATTGACC Majority 
. 3360 . 3370 3380 • 3390 . 3400 

aITtTTITITaTg^ cohl^i2.se q 
ACGTTTGTAAA CGI T G AHt TTTATC A ACACCTTTTCTT A6TAUTTGA CD a909_al2.seq 

TATTTGGCTTGCTTACTATCA A GGTTTACTTGTGTT.AGATCATCGTCTTT Majority 

™™ 3430 344 0 3450 

TATTTGGCTTGGTTACTATCAAGGTTTACTTGTGTTAGATCATCGTCTTT ,nhi ^ _ 
TATTTCCdTTCCTT ACTATC A A.G CTTTACTTGTCTT AQ ATC ATCGTCTTT a909^a£2"seq 

T A T T C C A A T A C C T T G A A A T G C G G T A C T T A G A G T A A A A A C T T G C T T A C C A T Major! ty 
J 3460 3470 3480 3490 3500 

^ii^M TACtTTGAAAT ^ CGGTACTTA ^ AGTAAAA ACTT GOT T.A C C A T cohOU2 seq 
TATTCC.AATACCTTGAAATGGGGTAGTTA-GAGTAAAAACTT G G TTACCAT Somali 

C A A C A T C T T T A G C T T G f G C T A C T T G G T. A A A C A A G T A A A T T A C C G C C A G C C Malorl tv 
**}° 3520 353Q 3540 3550 

3501 CAACATCTTTAGCTTGTGCTACTTGGTAAACAAG T'A A A TT ACCGCCAGCG aQOglailse^ 
' A ' T ACCTTG ATT ATT A TACT TAT T.TTGT AT ACT A AT AC A A CC CCTTTTC AT Malor-lty 
. -3560 3570 3580 3590 ^3600 

3551 ATACCTTGATTATTATACTTATTTTGTATAGTAATAGAAC.CCGTTTTCAT a909_ai2.seq 

C T G A T C A T T G G T.A T C A C C A^G A C A C A A G T T C A G T A C T T A C A C .T A A A T A A T A Majority 

. . 3610 3620 ' 3630 , 3640 3650 

3601 CTGATCATTGGTATCAGCAGACACAAGTTGAGTACT TAGACTAAATAATa " ™hl* ^9 

3601 ctgatcatt--ggtatcagcagacacaagtt.gag.tac"aga" 

AGAGAAGAGTTATCTTTAG GATCTTTTTATAAATCATTCTTCTCTTCCTT Majority 

, gggg -S^ 0 3680 3690 3700 

3651 AGAGAAGAGTTATCTTTAGGATC TTTTTATAAATCATTGTTCTfT*rr i-tt ^ki «o 
3651 AGAGAAGAGTTA tJH;T TTAGG ATGTT TT^AT^.£at^A^T^T^C^"CTTC CTT a90^Zai2!seq 

TCTCATTGCTTGTTTTAAAAT T TTCTTACG.TTGACGTGCTC.TCCTAGTTA Majority 
37 .™ , -3720 - 3730 ' 37,40 3^0 

3 I 0i TCTC A T T G - C T T $ T T T T.A A A AT TTTC T.f A C. p T TGACCT G G T C T C C T AGTTA cohl ai2 sao 
3701 TC T C A T X OCT T G T T-XT A. A A ATT-TJ € ;t T A C G T T G A C.^GCT C'TC C T A 15 i t^t^ 

C T T C T A A A G A G A T T A A A- A* C T 'A AAA T C A A ACT A A C C A A. A 'A T A C' C G A T- A A A' T Majority 

' / ^P 0 /■ M?* \ a ; ;378Q; . ' 3?90 '.. . .3800. ' ' 

C TT C.TAAAGAGAT T.A, A A A.G T A A A AT CAAAGTAA CC A A A AT A GCGATAAA T cohl alZ^ea 
C T-T C T A A AQ A G A-T T A. A A ACT A A A -A'T CAAAGTAAGGAAA A-T. AGCGATAA AT a90^Zai2^seq 

GGTGC. GATATAAATAGGCT-CTATTTGTAT T -GCCTCTGCTACTACCAAAGC Majority 
3810 - . . 3820 - 3830* - ivAtx 



3301 



3351 
3351 



3401 
3401 



3451 
-3451 



3751 
3751 



3801 
3801 



3851 
3851 



; • 38 . 2 0 - , 3830' . 3840 3850. 

C Gtr ftTl * I t A M t 5'5 C I SI A I ^ T G T A T i'6 C C T C-T G ; C TACTACC A-A A G C cohl^iE.seq 
GGTGCGATATAAATAGGCTCTATTTGT A'T T. G C C TCTC C T A cflA CCAAAGC a909_ai2;sfeq 

C T T A C C A T T A T C G T T T G G T A CACGATGT C CTCfCACTAGTAACCGATGGG Majority 
' . - 3 y° 3870 3880 3890 . 3900 

^ T If5jf TTATCGTTTGCTicACCATGTCCT CTCACTAGTAACCGATGCG cohl^ai2 sea 
CTTACCAT-TATCGTTTGGTACACGA TGTCCTCTCACTAGTAACCC ATGGG 3909^12^ 
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T A T T Ajfr gX"ft,A' f mm f =fe Ty jPfcaSWa «b C AAACTTTCCTACTCTTTACCT MaToTTTT 
39*0 3920 3930 3940 3950 



3901 



T A T T A A C G C C A T A T G G T G T A C A C G T C A C C A A A C T T T G G T A C T C T T T A C C T cohl a i2 sea 
T A T T A A COC C A T ATC GT C T A C A C C T C A C C A A A GTTTG G T A C T C T TT A C CT aOOsIaiZ^seq 

TTAAC A ATTTCTAAATCC C TCAAATCATCCCCTTTAACTCTTCTGATTTC Majority 
39 t 6Q 3970 398Q 3 990 4^00 

Sf! JI^ CAATT TGTAAATCCCTCAAATCATCCCGTTTAACTCTTCTCATTTG cohl ai2.*eo 
3951 TTAACAATTTGTAAATCCCTCAAATCATCCGGTTTAACTGTTCTCATTTG a909lal2.seq 

AT CCA CTTCATAAGTATAT GTTTC ATTTAA G ATACTG ACTGT CCAGTGCT Majority 
- ; . 4 °. t0 4020 4030 4040 4050 

•5221 !I^ CACTTCATUGTATATCTTTCATTTAAGATA CTCACTGTCCAGTGGT ^ al2.seq 
40OI ATCCACTTGATAAGTATATGTTTCATTTAAGATACTGACTGTCCAQTGGT aSOsIalZ.seq 

CTCCAGCTTTTAACTTATC CAAATCAGAAA A A AG CC TTC A A C A G GC T A A A Maloritv 
4( > 6 0 4070, 4080 4090 4100 

4051 CTCCABC-TTTTAACTTATCCAA AT CACAAAAAAGCCTTGAAGACGGTAAA cohl ai2.seq 
4051 C T C C A C C T T T T A A C T T A T C C A A A T. C A G A A A A A A G C C T t C A A C A G G C T A A A a909lal 2 :^ 

CCTCTATGTCCTGATAAAATAGAATG. AGTTGAGTCTCCTCCA, ATTGGAAG Maloritv 

______ 41 . 20 4130 4140 4150 

4101 « C T CTATGTCCT GAT A AA AT AOA AIC AC TTC A CTC t CCIC C A ATTGGAAG cohl al2.seq 
4101 . C C T.C T A T G T C C T G A T A A A A T A G A A T G A G T T G A G T C T C C T C C A A T T GG A A G a909Zal2.seq 

ACTA CT T C CT TCTAAATGACC A A T A C A ACTTTC A AC C A C TTTTTCACTTG Majority 

4 \ G0 - 4170 4180 4190 4200 

4151 A C T A C T f C C T T C T A A A T G A G C A A T A G A A G t T T G A A G C A C T T T T T C A C T T G cohl al2.se* 
4151 ACTACTTCCTTGTAAATGACCAATAGAAGT TT.G A A G C A c|I T T TT C A C T T G a9oCal2.seq_ 

■ T A C C A T G A T A A A G T G G T A A T T T T A T G T T T A T C T T T G G A A T T GAAATATAA Majority 
.. . - 42 .'° 422Q -4230 . 4240 4250 

5w I A J: ^ - l G A T A A A G T G G T A A T T T T A T G TTT A. T C TT TGG A.ATTG A AATATA A cohl ai2.seq 
4201 T A CCATGATAAAGTGGTAATTTTAtBtTTATCTT.TGGAATTGAAATA T A A a909lai2.seq 

CCC.ATATTACCCGTTTTAT CGATAGCCAGTTGTGAATTATAAtCCAAACG Majority 
_. 4260 427Q , 4280. 4290 * 4300 

4251 CCCATATTACCCGTTTTATCGATAGC-CAGTTGTGAATTATAATCCAAACG cohl ai2.seq 
4251 -CCCATATTACC CGTTTTATC GATAGCCAGTTGTGAATTATAATCCAAACG a909_ai2:seq 

CTCTT-CCTTAGTCAT GT GC CACTTCATTCCTCAAGTTTTAAATTGCTTAT Majority 
, 4310 4320 . 4330 4340 4350 

430} CTCTTG CTCATGTGCC ACTTC ATTCCTCAAGTTTTAAATTGCTTAT cohl ai2 i seq 

4301 C T C T T G C tH A G T C A T G T G C C A C T T C A T T C C T C A A G T T. T T A A A T T G C T T A T a909lal2,sS 



9__al2.sgq 

TAT ATTCTTTGG CTCGGTT A AT H TTTTTTT ATACTCCTTTTC ATCC AT A Majority 

4360 .. 4370 , 4380 4390 4400 

1? 5i .? A T A T t GT T.T G G G TCGCT. T:A A.T A A T T T TTT TAT AEJT CCTtT.TC ATCCAT A "cohl_al2^seq- 
4351 T A T A T TC TT TIG G CT C GG T.T A A X A A T" TT.T-t T T.A TAG TCGTTT.tC AT CCA t. A a909._al2.seq 

f G C CTT.A CC C G C T CT T GGT A: A T.C CAT A A T C G € T C G A G A T T G G T G A A A T. G A Majority 

. - 4410 . : . ,. 442 P 4430 . 4440 * l 4450 

.4401 TC.C GTTA C.G C G GTCTT GGT A ATCC A T A ATC GGTC G A G ATTG G T G A A AT G A eohl_al2.seq 
4401 TCCGTTACGC G-G T C T.T G.GTA ATCC.ATA ATC G CT CGAGATTGGTG A- A A T G A a909_al2.seq 

ATTCCAATAA TT AGGAA-CTG A AG G ATAAGCC ATTAACCCTACC C CCA-CT'G Majority 

• 4460 4470 4480. " 4490 4500 

ATT' C* C A A T A A T T A G. C A A G*T GAAGGATAAG CCA TTAAGC CT AC C C CC AC T G cohl al2 sea 
4451 ATTCC A ATA ATTAG.CA AGTG AAGG ATA AGC CATTAAGCC T A CCCCCACTG a909_al2!seq. 

C A A T T A T A G T G A C A AGCA A AATCC AT ACT A A ATCTT GTCTT ATTTTTT.TC Maloritv 

\ 4510 4520 4530 4540 * 4550 

4501 CAATTATAGT.GACAAGCAAAATGGATACTAAATGTTGTCTTATTTTTTTC cohl_al2 sea 
4S01 CAATTATAGTGACAAGCAAAATGGATACTAAATGTTGTCTTATTTTTTTC a909_ai2.seq 



FIGURE 21F 



t^^2^.1WO ^2006/078318 * mJKt* Weighted reside ^ ^ ^ ^ ^PCT/US2005/027239 }8 

" A ' TAT pg^SSll ^ : i A /t* atcttattaacaac rr rr ^t^- 

45 . 60 4570 458Q ~~ X7~ 711 



« — j— 

- - , . 4 y° 4590 4600 

AACCACGACCAGCAACAACCACCATACCTGCTrr T ATTACTAAAAtTGCA Majority 

. J 46 - t0 4640 ~^so 

CCTATAATGTAGAAAATTGTTGTACCAATACCA C CTCTTCAAGGCAAnTf: Majority 
46 . 6 ° 46 7° 4680 4G90 4^00 

ACTACCTTTGt.TATTTTCAACAGTTOGGTTAAnTA A AAGGTTATCTG, u.. y 

— " 7 . t0 47 . 20 ' 4730 4740 ! r ~^50 

i:i:s^;?;;{s;?;ui;?j??«is«js?;jjiish;ij?;s}ji she: 

TACTCCTATCACTCCC TCCAT CT CCTA AAATAACnTTCTCACAGTTATry Ma jority 
22? 47 7° 4780 4790 4 ^Q 

AACAAATTGTAACC. TAA, GGGAGC C'T TTTTCTCA A CTAG-ATA GTATGTA.CC. Majority 

: i lf2 «30_ 4840 4850 

A A C * * AT TGTAACCTAACGGACCCTTTTTCTCAACTAr AT« r t i t r t « o ?• .„ 

TTCTTTC A AGCCTGTA ATGGTAATT A TACCATCTCCTCCTCTTGTATATT Majority 

• " 48 ?° 48 7° 4880 , 4890 .. 4^0 

T TCTTTCAACCCTCTAATGGTAATTATACCA TrTrrTfr4-rTT/-f.^.^X. 

TTe«T^A P ccT«, A *T««TA^;u»"Si;j;Sc;gs;s?;s;i?iu.s5K^^ 



4701 
4701 



4751 
4751 



4801 
4801 



4851 
4851 



TGTTCCATTACCTTCTGTGCCCCATTCAACGTTA T TTGTATCOTT.AAr. parity 
_ 49 . 1 ° ;«» ' 4930 4940 . 4950 

TTTAGAAATTGACCCGTAGCATTCTTTAAAAr. A AATATAGCACCTTCTAA Majority 
. 49 . 6 ° 49 ™ 4980 4990 5000 " 

TTTACAlA?Tr A~ r r £ r t * ^ A ? T C T T T A A A.A C A A A T A T A 6 C A C C T T C T A A 

AAATT G ACCCGTA G. C ATTCTTTAAAACAAATATAG-CACCTTGTAA agoglaill^ 

TGAAGCTTTTGTGCAACOATCAATTTT T T-TTATAr.TAATTICACCA.rrnp Majority 

SSL ■ S< *?° 5040 SOSO 

I li AG CilTTt 111 A A r ^ * *.I ^.T. T T T TT A T A G T A * T T T .G A C .C A T C C C .ohLaizW 

T fr A .A CCII.T T G T.CGAACC A T C A A T TTT T TT. TATA G TAA T T TO A C CAT 6 C C ^slalZ.seq 

^ ~ *j=£3Ui T T A C'-T T. r.T T G A C C T. G ~G C T C A T.C A T T G.ir. T -A C T AT T G G G C T T C:A T G Majority 

' . ?>g° .- ,, . sgro.. ■ ... s6>o , " : so'ot ~~~ 5^00 ; 

T C A C T &TT AC Tf TT T G A CCT6C G'T C A T C A T T C£ TAGTATTrrrrTTTA t r ' : « '.i 
TCAfTCTTACTTTT^tAG-CTGCOTCATCA T.T OCT ACIAT T G G.G'G- TT CATC a9M~alz's^ 
■G T C4»'C AATGTTTC IA T.T T T C'T GC-TAAATCAG C 'T CAACCTCC T-T.T A GC T C -C Majority 
. ■ 5l . 10 ■ Sl ?° , . 51,30 5140 . ' sTso 

■ ■ ACTCTTTAATACTCCTCTATAAGTGAC-TCT GATT'GTATTTATTCCC.TTAT Majority 

St - 60 5170 ■ , S180 S190 5200 

5151 A C T C,T TTAATACTC CTGTATAAGTGACTCTGATTGTATTTATTrrrTTAT ^u, ,.-> 
5151 A ^^CTTTAATA-CTGCTGTATAAGT-GACTGTG^T^G^A^T^A^T^CC^TAT a909Zal2]seq 



4901 
4901 



4951 
4951 



.5001 
5001 



•5051 
5051 
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tSS^5^,°WQ, 2006/078 3 18 " ™*^t Weighted jesidoe weigNtabfe, '^^^ ^£cT/US2005/027239, 

"~ T7 A A A-Gy A I f jglSl A C T A T T T C C G G T T j C A FTTTT^ T^: ~ 



5201 
5201 



5251 
5251 



5301 
5301 



y^iQ 

A " " ! " A " ^^Miai fj ^ffiP j£fl! AGFA,TTTr.r fi r.TTUAGTA T~T7^ ^T 

52 . 10 ^2 S230 5240 ^O 

AAAAAAAGTCATCATTAGCTCCATTTTGAGTATTTCCCCTTrr ArTATT^ . , 10 
AAAAAAAGTCATCATTAGCTCCATTTTGAGTATTTCCGGTTGCACTATTG a9Q9Zal2[seq 

CTAGCTGCCCACGCAATAGTAATCGTGAAAT T ATTATTTTCCTCTAACAr. Majority 

. f^2. f£E S280 , 5 290 5300 

GTAGCTGCCCACCGAATAGTAATCGTC A-A A T T-A TTA TTTTCCTCTA A C A r fnhi 
eTAGCTCCCCACCGAATAGTAATCGTGAAATTATTA??"cCTC^ACAG SStjSl^ 

GTTATACTTCCCAGTTCCTTTTTCCGAACCT T GAGTTACACTtGTAATAT Majority 

53 .'° S330 S340 S350 

eiTAIACTTCCCAilTtCTTTTTCCOAACCIIGACTf AtifTTfTiiTii t , ,„ 
GTTATACTTCCCAGTTGCTTTTTCCGAACCTTGAGTTAGAGT T G TAAT A-T a909~a!2]seq 

. TCCCTGATCCATCACTAATACTTACTTCATAA C ATCCTTCCTTCAAATCA Ma ,„-, f „ 

" ■ 53 . 60 f2» 5380 S39Q 5400 

5351 •""TCATCCATCACTAATACTTACTTCATAACA HcCTTCCTTI'lllTrl ... ,, 
5351 TCCCTGATC C A TCAGTAATAGTTACTTCATAAG A^ CCtTCCTTCAAATCA a^09^al^*seq 

ACTACAGAAGCAGATGCCATAGTATCCT T TATAACATATTGATACACTTT Majority 

fj^ S4 , 2 Q 5430 5440 5450 

ACTACAGAAGCAGATGCCATAGTATCCTTtATAACATATTf KTA>irTTi u« ,o 
AXTACAGAAGC.AGATGGCATAGTATCCTTTATAA'CATATTGATACACTT T a909Z2ii2.]seq 
T T C T G-T A C CAT GATAATTGACTGC A. T TCTT AT A AGTAATACTAT AT T'T G A Majority 

■ " 54 . 6Q . S470 .5480 . S490 ' 7s00 

T T C T G T A C C A T C A T A A T T G A C T C C A T T C T T A t A A G T A A T A C T A T A T T T C A ~ai ~ 
T T C T G T A'-C CATGA jT A. A TTGACTGCATTC TT A T A A -G T'A A TA GTATATTTGA a^&Zaij^seq 

CT G T AT C A C C A A C C G A G T A C C T TT T T TG A T C T A C A G T T T T T C C A C C. A r. r. * Majority 

S5 .'° , 5S . 20 SS30 ' 5540 • 5550 

CTGTATCACC'AA.CCCACTACCTTTTTTGATCTAC A C IT T T T C C A r r A rrl hi .•> 

Sm .CTGTATCACCAACCGAGTACG IT TTTTGATCTACAGTTTTTCCAC C.A CCA a909Ljal2~seq 

TCTCCCCATCTCGCATCACTATTCTTTTCAT G AATAGTAGCATTTCCAGT Majority 
SS70 SS8Q . 5S9Q 5600 

TCTCCCCATGTCGCATCAGTATTCTTTTCATCAATAGTAGCATTTrrirT • .-> 

TCTCC.CCATGTCGCATCAGTATTCTTTXCATGAAfAG?AGCA^ 

TACAGATGTAACCATAATTACAG'CTCCATTAT T AACAGTGCTAGAAA.CAT Majority 
. 56 , t0 J> 6 t 2Q 5G3Q 5640 5650 

T^a^I GTAACCATA A T W A CAGCTGCATTATTAACAGTGCTAGAAACAT cohl *12 sea 
T. AC AGATGTAACC A.TAATT AC AGCTC'CA TTATTAA CAGTGCTAGAAA'CAT a909Za£2^seq 

A A T A A f A T C C A T A T T G G G A A A C A T T A A T A AC C T C A G T A C C A T C A T T A T T T Majority 
^ 56 ?° . . ^7° ; ■ 5680 5690 " 5700 

A AT A ATA T C C A T A T.T GGG.AA AG AT T A A T A.A C C TC A'GTKC ATCAT'f A T TT a909_ai2.seq 
G A C, T C A G-T A AC' AC T G OA A A C T G GT C T A C T A Jt A C C T 'G, At AT Afi.A-TTT A G C Majority 
, 5?10 . " 5720 . ,5730 , S?4 0 ' 5750 

* 701 GACtC KGt A A C AG T G G A A A C T C G T G-T- A G T A T T A G C T C A T A f A C AT T T A r f ™m \>xo -~ 
5701 G A C T C'A G-T A A'CAGTGG A' A AC TGGT GT ACTATTAGCTGA'TATAfeA? TT A'l C a90^Zal2!s^| 

C'C ATGTCGCA AT C T C A TTT-G CTGA CCCA GTATCTTTTT T AC.TTAC AT A T G Majority 

' ' - S ?° 57 ?° . 5780 5790 5800 

5751 CCATGTCGGAATCTdATTTfiCTGACGCAGTATCTTT TTTA G-T TACATATf* ™hi ~t<> — 
5751 CCATGT C-GX A A'TCTCATTTG CTG AC-G C A GT ATCTTTT TT A G/TT AC AT AT G •a^09iZal2^s^ "• 

TTCTCCCTCC ATTACTAGTTGTCGTA A A A AG A G A ATT A A AATC AGTTG A A Majority 

- ff20 5830 5840 5850 

5801 TTCTCCCTCCAtTACTACTTCTCGTAAAAAGACAAT TAAAATCAfiTTr.'l' rAhi 
5801 TTCTCCCTCC ATTACTAGTTGTCGTA A A AAgIgAa" At A A ^C^ 



5401 
5401 



5451 
5451 



5501 



5551 
5551 



5601 
5601 



5651 
5651 
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iroethoTTvwth Weighted residue we^ fabte. ' ' 



^^^^^WO 2006/078318 ** meth^&h Weighted residue weight table. ' ' ° ^PCT/US2005/027239 10 

G C T T p rA - T T j pSagjj^jjcii GAA TTAAATAAG.AAGCTCCATC Malorlr. ' 

S8 f 6 ° S870 S88Q 5890 590O 

CCTTTATArTrirrTT^^^ ^ cohl.ai2.seq 

5851 GCTTTATACTCAGCTTCTTTACCTTGAGGAATTAAATAAGAAGCTCCATC a909__ai2_seq 

TTTATTCGAATCAGATAC A TTTGCATTATCTATTTCTGCATCAAAAACTT Majority 
f 9 ™ S920 5930 5940 5950 

&*) ^'I I{ ^^ ^^^^^^CCATTATCTATTTCTGCATCAAAAACTt oohl ai2 seq 

5901 TTTATTCCAATCAGATACATTTGCATTATCTATTTCTCCATCAAAAACTT aSOsIaU.se^ 

TGTATGCTTTATAGGTTCC GCCTTTTTGAGTATCTTGAACTGTAATTGTC Majority 
59 ?° ; 5970 5980 5990 " 6000 

SH! WE^SSI" ATAC «*TGCGCCTTTTTG ACTATCTTG AACTGTAATTGTC.coM ai2.seq 
S9SI TGTATGCTTTATAGGTTGCGCCTTTTTGAGTATCTTGAACTGTAATTGTC a90 9^12. seq 

CCTGTCTCAG.CGGCAAAAGC TATCGGCGTAAGT C'C TCATACACCCATACC . Maloritv 

60*0 . £020 6030 6040 6050 

6001 CCTGTCTCAGCGGCAAAAGCTATCGGCCTAACTGGTGATACAGCCATACC cofal a!2 sea 
6001 CCTGTCTCAGCGGCAAAAGCTATCGGCGTAACTGGTGATACAGCC ATACC SS^ii^ 

AAATGCTAAACTCGCCAC TAACAGCGATTOAATCATTTTCTTTTTCATTG Majority 
gOgg • 60/0 6080 6090 6100 

AAATCCTAAACTCGCCACTAAGAGGGATTGAATCATTTTCTTTTTCATTG cohi ai2 seq 
A A A T GCTAAACTCGCC ACTA AC A.CC G A T.T.G AATCATTTTCTTTTTCATTG a90gl]al2.seq 

■ AAATCTTTCTCCTAAAATCATATTG'ATGAATGATTAATTCATA'TTTTTTT Majority 

. 6t t 10 . 6 \ 2 ° 6130 . 6140 6L50 

fJOI A A A T C T T T C T C CTAA.A ATCATAT.TGATGA ATCATTA ATTC ATA.TTTTTTT cohl_ai2.seq 
6101 A A ATCTTTCTCCTA A A ATC.ATATTG ATGA ATG ATT A ATTCATA-TTTTTTT aSOoi^s^ 

T C C A T A G T A T A A T ATT AATC C T G A T CG T A G A C CT A A A G C TA AACC A A CT.A Majority 

. . e \ m 6170 6180 , 6190 - 6200- 

til] I - S! T A G T A A A. T A T T A A T C CTG ATGGTAGAGC TA AA GC.TAAAC C A A C T A cohl^i2.seq 
6*51 TCG ATAGT.AT A A T A T T A A T C C T G A T G G T A G A GC T A A A GCTAA ACCAA.CT A aSOg^s^ 

GGAfATAAATGTGTGTTCC AATACCTCCAGTACTAGGCAATTCTGTTCCT Majority 
62 . 10 622Q 6230 6240 " 6250 

t™\ ^ATATAAATGTGTGTTCCAATACCTCCAGTACTAGGCAATTCTGTTCCT cohi ai2.seq 
6201 G Q A TATA A AT GTGTGTTCCAATACCTCCAGTACTAGCCAATTCTGTTCCT a909lai2.seq 

TTACTGTTAGTAATTTTAA A A G T A T A T A C T G T A C T T C C A T C T. AX T A A ATT Majority 

62 t 6 ° 62 70 6280 6290 6300 

6251 ~~ " ~~ " 

6251 TT A C T G T T A G T A A T TT T A A A A G T A T A T A C T.G T A C T T C C A 



6051 
6051 



TCTACTA A ATT a909_ai2.seq 

,C T C T T T T A T T G G T G T C G C A T T A T T A C C A T T T T G T T C A A A G G T A A C T C C C G Majority. 

• : S?} 0 . . 6320 6330 6340 6350 

*£I CTCTTTtAT.TCGT.GTC GG ATT ATT A C £ A T t T T ^ T f € A A A G G T A A C T C C C G cohI^i2.seq: 
.6301 .CT CT T T T AT T G GT G T C G G A T T A T T A C C A T T T t G; T T C A A A G G f A ACT C C C G a9 09^12. seq 

T AG-AAA T C A.C T A AT A C T. G A T ATA TIC A T TT T T AG C. J. ACTA G C TA CC.CT GC A Maloritv 

. • . -;6360 .'" ' ' 6380 . 6390 6400 

■•£}■ J * $ A A A A C T A A T A C T G A^jT A T A T C A TT T T T A G GT A G T A G G T A G C C^T G G A cohL_ai2.seq 
6351 T A G A A. A T .C A C T A A T A C T G AB A T A T C A T T T T T A G G T A G. T A G G T A C C C T G G A a909lai2 .seq . 

6GGCCCTTT.GTCTCTCTTA GG-TAGTAT TT T CCT AC t G G C A A A C -,T G A G G T A Majority . 

64 f to . . . 642Q " 6430 6440 6450 

™> l GCCTTTGTCT. CTGTTA GC'T AGTATTTTCCTACTGGCA AACT G A G G T A cohi al2.seq 

6401 G G G G C C T T T G ,T C T CTG TT AGGTAGT A T.T T T C CTACTGGC AAACTCACGTA a909^I2.seq 

G TTA T. TA. C CA.TC C. ACTA ATA ACAAGCCTT'T AT CGTTTGTCA CCA GC CCTG Majority 

' 6 * 6Q * . <X™ 6480 6490 6500 

6451 GTTATTAGCA TCCACTAATAACAAGCCTTTATCGTTTGTCACCAGCCCTG cohi ai2-seo 
6451 G T-T A-TTAGCATCCACTAATAACAAGCCTTTATCGTTTGTCACCAGCCCTG a909Zai2.seq 



FIGURE 211 



n method with Weighted residue wektfrt tabte 



n^^^^&' 0 ™ 18 *T^^™^^^~± J ^PCT/US2005/027239 



d:feATtAGCA TCTGATTCATAAATA Majority 
6510 6 S20 — T " 



6SOI 
6501 



6551 
6551 



6530 6540 



6550 



- A TACATA GGATGTGAAGCTTTATTCC CATTAGCATCTGATTrATAAA T l „ IO 

TCAAAAACTGCACCTGCTAAAAAATTATTATCAT T TTCGACATTAACTTT Majority 
65GO cc-xn 1 ~ ~*~ 



6560 6570 6580 6590 



6600 



6601 
6601 



CTGTAGTCGTACTTTTTG CTTGAT AC G TGTATTGGTAAAGCTAATATrT. MaJorlty 
ggg 6620 6630 6640 



6651 
6651 



CC.TCTCCTGAAACTGTCAGCGATTfiTA' ^ GCCGGTACCATC-ATAACTTTTA. «a Jorlty 
6660 6670 6680 6690 ^00 



6701 
6701 



TCACCTTCACCAGTTCC T'A CATTTTT TTCTGTAA TTGAC T'C ACATACTTT «,,^,^ 
: 67 . 10 S7 . 2Q CT30 6740 1 6^0 

TCAGCTTCACCAGTTGCTACATTTTTTTCTCTAATTGACTCAGATACTTT Tx^ll ^ 
A A A TT C A T C C T A G G C TT G T TC A f C T A T T C A T A T A G A A G TTC C A T a « r. r. t a Hajorlty 
CT - 60 6770 . 6780 6790 . ~ 6800 

C T T T A~'A A-TTGCTTAGT CTCAC CA TC TC TCACCCCi A'AATTCTCT T-c t't g f* )u Joclty 
6810 6820 , 6830 6840 : 68^0 

AACCTTrCACTTGGATT AAAC AA G AAGTCTTTCGTCTTATCTTC ATCTA G Majority 
P 8 . 60 - • ■ 6870 6880 6^ e ^00 

6851 AACGTTTCACTTGGATTAAA cohl a{2 .seo 

AACGT. TTCACTTGGATTAAACAAGAAGTCTTTCGTCTTATCTTCATCTAG a909lai2.s2j 

TCCAACGACAGTTTTACTTACTCTGACGGTGTAT T CTTTAGGTTGCCAAA Majority 
69 . 1Q 6920 6930 6940 



6801. 
6801 



6901 
6901 



T.CC AACGAC'A.CTTTTACTTACTCTGACGG TCT ATTCTTTA TTTTT r r < a V 
: *CC*ACCACAGTT»TAeTtAiTCT«*CcS;S?A"S???ltSS??S^iil SSS^53 
CAGCATATAAGCTATTTGTTCCA T'C A C C C T T G T T A T C A A T A C C T A T T C A T Afa 
. 6 ^°- 69 . TO , ... 6980 " • " : ^ 

T G A C € T- G C T G T A.A A T T C C A C -A C C T C C t G T A T C A G n.T A A A T C C f T'A T C. A T -ii u^„ T 
.7010 7020 - 7030 704O* 



7001 
-7001 



.7051 
7051 



7101 
7101 



7050 



T C A C G T C C T G T A A A T t C C AC AC C T C C T C T A TC A CCTA A A T'C CT T A T r A tt ' m U 
TOACCTCCTGTAAATKCACAGGTCCTGTAKUCmAU 

ATGCCAACC A A t AAGCT, T G T A , A CCTGTC GTTCT A 'A AGTATf C G'TTTTC AG Majority 

™ G ° • 7070 ' ' 7080 7090 " 7l 00 

AT CCCAA.CCAATAACGTTCTAACC,TCTCCTTGT AAAGTATTrfTTTTrAf u, ,o 

a.tgccaaccaata.aggttgtaacctctcc?tgtaa1gtattgg?t^ 

GAATTCTACTTGTGCTATTCAACTCCATAGr.r c cTGTCTnTACTTC.TCTT Majority 

' n . M 7120 7 ?0 . 7140 7^0 

GAATTGTA'GTTGTGCTATTCAACTCCATACGCG GTGTCT cH ACTTGTCTT r^ht ,■> 
CAATTCr*r. TTCTCcti -r-rr* i « t* i- f r- . ~ . „ _ _ AVI A G i G T T cohl_al2.seq 

i» a a t AA»A AGTTGTGCTATTCAACT-CCATACGCGGTGT. CTCTACTTGTGTT a909_a!2.seq 
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. J[J£! 7170 7180 7,90 7^00 

ACCACATTACCATTTTCTACTCTAGTACCACCGTT ACCATTCTATTTrAT ™hi ■ -> 
ACCACATTACCATTTTCTACrCTACTACCACCC^ACC^^^A^jGA^t^q 



7151 
71SI 



7201 
7201 



7251 
7251 



TGAGGTATCTTCTAATTTGATATCTCCTA C TGCAATAATCACAGGTTTTA Majority 

^ ^220 7230 7240 7250 

nGAGGTATCTTCTAATTTGATATCTCCTACTGGAATA ATGArArrTTTXA ^Hi 
TGAGGTATCTTeTAATTTGATATCTCCTACTGGAATAATGACAGGTTT T A a909~a!2!seq 

TCGTGATATTTTTATTAGCATCTCCTAAATCC GCGTCAATATCAATGGAA Uajorlty 

Igggj - Jf™ 7280 7290 7300 

TGCTGATATTTTTATTAGCATCTGCTAAATGGCCGTCAATATCAATCGAA cohl a!2 
T C G T G A T A T T T T T A'T T A G C A T C T G C T A *A A T G -G G C G T C A A T A T C A A T G G A A ^909Zal2?seq 



TCATATGGG 



7301 
7301 



TTATAAATTT TACCAT TGTACCACCAGCCACGGAAACGATA Majority 
. 73 /° 7320 7330 7340 ^ 0 

tTaTaTc^ cohl_ai2_seq 
TCATATGGGTTATAAATTTTACCATTGTACCACCAGCCACGCAAACGATA aDOoZaia.seq 

GCCATCTCCCATT GT.ee G A CGTCTCAGTAACGCTGAAT GC G AGCCATCOT Majority 

_ 73 *° • 7370 7380 7390 7400 

7351 rppiTflfr^ ATT C-T C G C A C G T CT C A C T A A C G C T G A A T G C C A G C C A T C C T cohl a!2 seq 



7370 7380 7390 7400 

-7351 OCCA. X CTCCCA^ 

CAT AG G- A ATCACGCTCA GTAGGCTC AT T A C TTGTTTCTA ATIGCTGACCA Majority 
7410 7420 7430 ~ 7440 """" ' 7450 



7401 
7401 



7451 G 
7451 G A A G C A 



7501 
7501 



: t , f — ' , *»w /4i>U 

C At'xgSa^t'ca'c^ 

C A T A G G A A T C A CGGTC AGTAGGCTCATTACT.TGTTTGTAATTGCTGA CCA a909_al2.seq 
G A A G C A T C C A A T G C T G G C T T T C C A T C T G T A C C A A C A G C A T C AT TGCTC.T A Mainrlr v 

74 . 60 747& 7480, , 7490 , 7500 

A A -O C A t C C-A. ATGCTGCCTTTCGATCIG T A CCAACA GCAT CATTGCTGTA cohl aiE sea ■ 
AAGCAT CC AATCCT8GCm.CC AT CTGTACCAACAGCATCATTGCTGTA Sdtal^ 

, T AT A A T A T C A T A A TCT CC ACCCTTTCC C CA AATA-GCTCTT AA ATTCATAT Majority 

' 75 . 2 ° 7530 7540 7550- 

ITf™^^^ cohl._ai2.seq 
T A T A A T A T .G A T A A T C T C C A G C C T T T C G C C A A A T A G C T C T T A A A T T G- A T A T 3909^12^ 

CTTG AGTTAC AGC ACCTGAA A A CTTATA AGGTCTAATACTACCATCTGC A Hajority 
■ 75 . 60 , JSTO 7580 7590 7600 

7551 CTTGAGTTACAGCACCTG^AAAAGTTATAAGGTCTAATACTACCATCTGGA a909_ai2.seq 

T T A A C A T A A T A C C A A C C G A C T A A T T T G T A A G C G T C T T T T A C G T A C T T G T A Majority 
. 7610 , , 76 . 2 <> . . ■ ,7630 : ; 7640 . . 7650 

5!?! Yl •^j tAA T A C.CAA.CC€ACTAATTTCTAAGCGTCTtTTACCTACTTGTA cohl ai2 sea 
7601 T T A A CAT A AT AC C^AC^ Ad T- A AT T T C.T A A C C G t^T Tit AC CT AC f A^^l^ 

g T!T A G T-XCT T G T AT C A A C AT TTCA GAGA- C T -AG' T A T C T G T C GTATAATA^ Ma,.Vr«^ 
: t . , . 7660 , , t 7670 , ; 7680 : 7690 . 7700 

lfs\ CTT A G T IS? T r t £ T r M ^ l?V I S M * 6 A " A G T A G ^ T A T A ' A T A G G cohl.ai2.seq" 

7651 CTTAGTTG T T G T A T C A AC ATT T-G A G A G A C T A G X A T C T C T C G T A T A AT AG.G a909_at2.seq 

. • CATCTTT.AGTTG AG T C G G G A T C T T T A T C T C G T G A A T C AUCTTAtAATAA Majority 
• 7710 7730 774Q - 7750 

CAT C T TT AG i Trk '^"Trr r r*'^ ^^^^^ A ^'^^. ^^T^^^^^ATACTTATAA T;A A cohl_ai2.seq 
C A.T CTTTAGTTGAGTC QG C'A T C T T T A T C T C G T.G A A T CAT ACTTATA-ATAA a909_ai2.seq 

J , A TG-TACC T C A AG C.ATCTTGCATATAAT C CCTTGTAATATCTGTATAATC Majority ' 
7760 7780 7790 7800 

^I^tITc^^ cohl_ai2.seq 
TATGTACCTGAAGGATCTTGGATATAATCCCTTGTAATATCTGTATAATC a909_ai2,seq 



7701 
7701 



7751 
7751 



FIGURE 21K 



— « c a a mm *** ^m»TrATT i G T A T c A T c T c T T T T T . 

78 lO T«?o __V_ : 



7801 
7801 



7820 7330 



7840 7850 



* T ,**TTGCCCTCCCTTTC C ATCAATATTGACAP.r.A T ATCTTACrTTTTr.r MajQrlty 
??" 78 , 7 ° ™ 7890 ""^oo 

CAACCTCCATAGACTTTA ACATC A TG AGGAGCCATAG.TCCTftTTAAACT.r HaJorlty 

AAATACTTGTGTTTGTGCTTC GTC T TT ATACCATTTACCATCCCAAArAT u*j orlty 

"j 60 ™° 7980 7990 "l^o 

AAATACTTGTGTTTGTGCTTGGTCTTTAT ACC AT T TA CCATrrrA 4 , f ,t ... 
. A A A T A'C T T G T G T TK TCCTTGCTCTTTATACC'ATTTAC.CATCCCAAACAT a909Zal2~se<j 

ACCCTGGTCGACTACGTTTAGGTTGA A C C C T T r. T c C T A T C C C C C r. r. A T A A «a Jortty 

W . 10 m ? a , 8030 8040 ~~ 

ACC.CTCGT.CCACTACCTTTAGGTTGAAcHcT TGTCGTATCGCCfrrATAl •.. 
ACCCTGGTCG A -C •TAGGTTTA.GGTTGAA CX -G.TT C ICG TATCCCepCCAI.il aM9Zal2!seq 

CACGA,CAAATTTf G C T C A T A T A G A A C A T C C T T T A r. T C C A A A A T T A C r. A AC u*J m liy 

23gg *!"> ■ «°. 8 ° [ 80^0 .. 8100 

CAGCACAAATTTTGCTCATATACAACATCCTTTAC TGCAAAATTAVrt^J- ., 
CAGGACAAATTT T G CTCATATA GA ACAT.CCTTTA C.T G G A A A- A TT A G G A A G aMsZal^iseq 

CTCTGTATTATCAACCC CATC TA A A T A T T T A A T C T T G T A T C A A T T A rcrr toJorlCy 
8110 - .sup :813 0 8140 " ' '~7^0 

C T C^T G -T A T T A T C A A G C g'a ~hl_ai2.se, 
ciun,.lATTA.TCAAGC.GCATCTAAA.TATTTAATCTTGTATGAATTACGTT a909_al2. S eq 

CAT.ACCATACCACTAAGTTCAAATAATCTTTGT C CTAGTCTQCATATTTA Majority 
8160 «™ 8»80 • 8190 1 8^00 



785 i 
7851 



7951 
7951 



800) 
.8001 



8051 
8051 



8101 
8101 



•«. CA-CCATACCACTAACT B c2a*^ 

' TCCTAGTA.TTCATCT CCGATT CCCA. CT TTTCTTTTTGCACTCGTTTr.TrT u,,^,^ 

-8220 : 823 <r . 8240 So 

8201 T CPT A CT A TT C A T C.T cU J A jB C cfl A C 1 I I aIBItMUt GrATTrrTTT rHr t k. .o 
820. T CO T AB T A T T p A T cQ G C G l|pc'cCA C__Bt T^^B | C^ItbTtTtItS 3S__S_5 

„TGCGTTCTCATCAAATAC CTAA TTA TC TGGATAT.AACCTTTCATAr.TATT Majority 
- • • _ — •__ ■ "S 70 . I'" "«?» ■ 8290. SoO 

S T C^B f t\hH_| A T C AAA? aHg T A ATT A fff J r A T A |_* A iJ c O T ? G A T *Bl A.-T.T .cohl_al2.soq- 

. «« <-U I >.HWU A X C A A -A T AflC TAAIMTC.TfifiAl aQ ,A AG C T ; T T G A TAG T A T T a909_al2._eq - 

T A A CJA T TA-AlX f C iC TA^G CJ A TT T TTti '.T.G T.A-" AAC C t A A.ATTCGTCT CC T C C A Ba.nr,^ 

C C A C C T C C C C C T C T G T C T C C T A A A G A G f A -T T T c C C A T C T A C T C C T T G T T T Ma . n,, ■ 
" .8360^ gyp gagp . 3390 . woo 

0 A C fTflf. cfSSfr C T t&l A A aS a^ t ^ ° C ° A T C ^mc C T T-G Tit cohl.alE.sea 
«f.C A C C TC C CC C TIOG T C T CHT A A AQ A G T A T TIC CCAT4JTACTCCTTCTTT a909.jal2.seq 



CTAGA.ACGGATAAT,TTTCAATTCTCTTCCr T TTTGGATAGAGTTTTATTT Majority 
84 . 10 8 y° ' . 8430 . 8450 



B? A G A AB^A T A A T T T T G A S T WtI^ 5 ^JSA ° ° ^ A i A G T FTHTrlrt! co h l_ al2 . seq 
• W 1 A G A AHGHATAATTT T G A A T T CQ C tDH C d^PBc GAT aQaHtTTTATT T a909_al2.seq 



FIGURE 21L 



^fwSS?^^°WO 2006/07831 8 *i meti^th Weighted mstciue v^QtttJa ^PCT/US2005/027239 M 

°" c T« j^ ^ 5j »g B g | S Q " ^'c"% IE cggta'ctatga'Actcacccaa a jfa^jg 

"I 6 " ^J 0 8490 ^OO 

S« C A r?^^^ 7 ^' 1 ' 1111 """ - tB ° ''-I. A PP A TGAACTCACCCAAA coM a.Z.seq 

K4ol C A Tinl G G A T T T G C Tiafl AH n T A c T r r aWB t r r « ^ ^ , ^W* * . , ^ . . . _ — 1 



GGATTTGC TDD aQ C T A C T C C a|| T C G g|J A G T A tQ:A ACTCACCCAAA a909_ai2.seq 
TAACTC ATTCC TTAT CTTCCACTTTGGTTATTTCC AACATTGGTTACGTA Majority 
«5I0 8520 8530 8540 8550 



8495 



8739 
8736 



ACG-CCATCC| GCTCT CTTCCATTCATACCCATTCGCC GCTA ACCTTCTAC Majority 

- ; . gggg . «?0 . . ' 8580 8590 8600 

22 * r?^^W?^^? GTCTTC ^ A ^» TG ATAHCCATtBgCGGcAa A CGTTGTAC cohl al2.seq 
8536 A C G C C aQ G CACCTCTCTTCC aJB T G ATAGCCATTCG cQc C T A AQ G tQg TAG dXHCrtzI^} 

• CCTATAGTC CTGTGTA GGTTTC.CCCATCTGATCCTCTAGTTCTAGCAAfC Majority 

86 . lQ 863Q 8640 8650 

ffo! g« T *BACTCCTGTCTAiaGTiTcBGC ATcHg Ag G C T cB a|| T j!|t aH G A A tA cohl a!2 sea 
8586 CBT A T A G.T C CB G T&T A G G T T- T C G G C A xtf?.C AT G Cfl C T A CTtfli G G A fife So^alts^ 

CTACTATTTTCCTAATGAATC TCCGAGTAGCCCTTTTTTGCAAATTTTAT Malorltv 

_^ 6Q _ 86 /Q 8680 8690 . 8700 

8644 G JL A ^£ A JL T n T G CPA AHC A AjJH TTc~G^Bi|MirC~cEf tH T T T T G C A A A T T t T A t ™hi ^ 
.8636 GpA 0D AB T T T Gffr A A TjQ A Atfc T cH G^G^KC C (^HmH T tQ G C A A A Tfl tB AT SKS^ 

T G T G A T C A G- T TTTCTATC AT A AT AAA CATTAACGAC ACTTC AACCATCGT Majority 

87 . 10 8720 ,8730 8740 . ." 8750 

mtr Im^Kmtt r GTT ^CTATC.ATAATAAACATTAA g||a C A C T T G A A C C A T C G T cohi^ai2.sea 
B G tDaDDa GTtBtCTAT CAT A A T A A A C A T T A A C G A C A C T T G A A C C A T C G T. a909Zal2:^ 

C T T TT A T C A T G A C A G A A G T T T C T C T C C T C G T A T T A T T A A C T T T A A A G C C A Majority 
•■ 8760 87TO ,8780 8790 8800 • 

£ ZZ ZZ tZ ^ A I 5 A $ A C A A G T * T C *• G T C C * C * T A T T A TTAACTTTAAAGCcl cohl a!2.seq 
CTTTTATCA.TGACACAAGTTT C.T.GT-C C T CCTAtT ATTAACTTTAAAGCCA aSa^zlSj 

C T C C G T A G T T T T T C A T T A A T A T C T f G T T C T G T T A G C CTCTC ATTAG ATAA Malorltv 

8810 8820 • ' 8830 ' ; 8840 ~88S0 

8789 CTC.GGTAGTTTBtcaTTAATATCT.TG TTGTG TT AGCGTCTG ATTAG AT A A cohl ai2 sea 

8786 g.t c ggQ aQ t t t tH cattaatatcttgttg. tgttagcgtctgattagataa Sm£zZ% 

ACATACGCCTCATCGTGTT ACTTG C CC TGCGT.ACTCATATGTCTTTTCCC Majority . 

- , 8860 . ,8870 8880 8890 8900 

8839 AGA JAGGCCTGATCG.TGTtACTTGCCCTGCGTACTCATATGT cohl.al2.seq 
8836 A G A T A G G C C T G A TC G T C T T A C T T G C CC T G C G T A C T C A T A T G T C T T TTCCG a9093ai2.^ 

C A T C A G T/A. G C A T T T T T A T T A TCC GTTGCTC'ATTGTTCCC ACT AGTTTATC Majority 

- ■ ; ■ ■ *y° , ... 8920 8930 8940 8950- 

j?S t A Z T£l Z * G H t T T T 5 T A T: ? AtCC G T T G C f.C A TT G TT:G.C C A GT A GITTATC ^hUai2.Seq 
8886 C A T C A 6 T. A CCA T T T T T A T t A : T C C ;G T T G G T G A f t G T t G G C A G T A G T -T t A T 6 ^OS^l^ 

,G T G T A CjG T T G T T t GTG CCC G C G A C C A A T G T G C AT AT A G C ^ T C GT AT C C T * Majority 
• ' . ' ' • 89 < 6Q ..' ^ 8970. " 8980- ' " * • 8990 9000 

^ ZZZZ A ^IISI TTCTCCGG ^ CeACC AATGTGCATATACUfcGTATCGTi cohl2.ai2.^q * 
8936 G T G T A G G T T G.T T T G T G C G G G C G A C C A A T G T G C A T A T A G C G T CGTATCGTT a909lal2.seq 

G G T C A A G A C T T G A T T A A A A T C A A A 0 G C T G T C C C A C C A C T A G C A C C T G T G T Majority 
9010 .9020 • , ■ 9030 9040 ^ 9050 

22 G A A ? A « ^ ^ T G A T T A A A A T C A A AG G C FcjfrcCC ACCACTAGC AGCTGTGt cohl ai2.seo. 

8986 GGTCA AGACTTG ATTAAA ATCAAAGGCTGTCCCACC ACTAGCAGCTGTGT a909lal2.seq 

A C C A C C C T G C A A A AG T AT AACCTGCCCTTCTTGG ATCATTAG.GCTTAATT Majority 
9070 9080 ' 9090 . 9100 

A.CCACCCTGCAAAAGTATAACCTGGCCTHGTTGGATCATTAGGCTTAATT cohl.ai2.seq 
ACCACCCTGCAAAAGTATAACCTGGCCTTGTTGGATCATTAGGCTTAATT a909lai2.seq 



9039 
9036 



FIGURE 21M 



T^^!^%.1WO 2006/078318 * ™ea3Kh Weighted residue weight taWe. ^PCT/US2005/027239 is 



jGT_CC A.^qg yfi;T[J gjpiaiq;; ^giffi' CGAGGTGCAATATAGGTAAC Majority 

9110 9120 9130 9140 9150 



9139 
9136 



GTCGAAGCAGGT T G Cffc TGTT A A C A C AC C A CG AG G TGC A A T A T AG CT A A C cohl_ai2 seq 
G T C G A A G.C AG GTTGGICTGTTAACACACGACGAGGTG CA ATATAGGTAAC a909_ai2!seq 

TCCTGTTCATAAGTCGCC TCTCTTGAATTCAACACCGTAC.GATTCTTTA'A Majority 
9*60 9170 9180 9190 " 9200 

TCCTGTTGATAAGTCGCCTGTCTTGAATTCAACACCGTACGATTCTTTAA cohl a!2.seq 
TCCTGTTGATAAGTCGCC. TGTGTTGAATTCAA-CACCGTACGATTCTTTAA a909__ai2.seq 

ACACAGGATAA ACACT T.ATCT C f G A AG AT A C ACC T A A T&CTTC A A TT TC T Majority 

^ 9210 9220 9230 9240 9250 

9189 A G AC AGG AT AA.AG AC T*T ATGTCTG A ACATA CACCTAATCCTTCAATtTC T- cohl_ai2 seq 
9186 AGACACGATAAAGACTTAT GT.CT G A A GATAC AGGTA ATGCTTC AATTTCT a909„al2.seq 

GATTCAGAAA.G TGGAGCTCCATTTTCAGTTTTAGACCAACCTACAAATAA Majority 
• 92, 60 9270 9280 9290 9300 



9239 GATTCAGAAAGTGGAGCTCCATTTTGAGTTTTAGACCAACCTACAAATA A cohl_al2.seq 
9236 GATTCAGAAAGTGUGCT.CCATTTTGAGTTTTAGACCAACCTACAAATAA a909_al2;seq 

TAATGTTCAAGT GCGTGCTGAAACTTTAAATCAACTTAGATCCCCGGTTC Majority 
9310 9320 - . 9330 9340 9350 

9289 H * A TG T T G AAGTG GG T G G T G A A AC T T T A A A T G A A C TT AffAT C CGCGG TTC cohl_al2 seq 
TAA.TGTTGAAGTGGGTGGTGAAACTTTAAATGAACTTAGATCCGCGGTTC a909_al2.seq 

CCTTTCCTCA A CTA TC C A A T G G T A CTTGATAACGCTCC A AAATACTTTTA Majority 



9360 



9400 



9339 CGTTTCC T G.A.A CT A TCCA A T G G TACTT.GATAACGCTCC A AAATACTTTTA cohl_ai2.seq 
9336 C G T'-T.T CCTGAACTATCC A AT GGTAC.TTCATAACGCTCC A AAATACTTTTA a909_ai2.seq 

CCAGCAGAA T C A tC AT ATA U CTT ACTG TCC C C A CTTT ACTC AT AAATGG Majority 

. 9410 . 9420 , . 9430 ... ; 9440 9450 

9389 C C A G C A G A A TC AT C AT A T A A ACT T ACT.GT C G C C A C T T T A C T C A T A A A T G G cohl_ai2.seq 
9386 CCAGCAGAA TCATCATATAA A GTTACTGTCGCCACTTTACTCATAAATGG a909_ai2.seq 

A CCA AC A T A A A T T TC T TTT.CT C tC AGtT AC.AGTT ATTGGCTC ACC A A ATT Majority 

• - 9460 9470 948Q 9490 9500 . 

94; *9 A C G A A C AQ A AATTTCTTTTGTCTCAGTTACAGTTATTGGC T CA CCAAATT cohl_ai2 seq 
9436 ACGAA:CATAAATTTCTTTTGTCTCAGTTACAGTTATTGGCTCACCAAATT a909_ai2.seq 

TAACAGGGTCA CCATACTTTC.CAGTAGTAGGATCATAGGTATACCAACCA Majority 
9510 , . 9S20. ^530 9540 9550 

TAACACGGTCACCATACTTTCCAGTAGTAGGATCATAGGTATACCAACCA cohl_ai2.seq 
T A A C A G G G T C A C C A T A C T T T C C A G T A G T A G G A T C A T A G G T A T A C C A A C C A a909_al2.seq 

T T A A A ATCCTC TCCTGC TT T A ATC CTCC G A ATC C C A A CT TCTCCf AC A C A Majority 

9^560 9570 /. . " 9580- 9590 9600 

9539 TTAVA A A T G C T C T C C T G CTTT A A T. C-GT C G G A A T C C G A A CTT C T C CT AG AG A txjlil>12 :seq 
T T A A A ATGC.'T/C T C C ; TG C T t T AAT C 6.T C Q G A AT C C C A A CTT C T C C T AG A G A ado^ai 2 .seq 

T TCTC CA TiCTT TTAT A ATTTG A T G A t G A A C T T G C A T A C:C T G AA GGTCTG A Malorltv 



9489 
9486 



9610 ' 9620 



. 9650 



9589 :T T PTC C A T C T T T T A T A ATtt» AtGATG A AGT'TG C A T AC C T G AACG.TGTGA coftt_ai2-seq 
9586 TTC TCCA T-C-t TTT AT A A' T T T G A T'G ATG A ACT T G C. A T A C C T G A A C C T G T C A a?09„ai2.seq 

C G A A A T T A T A A T C A G TTC CGTC' A T'TAT'TTTGAA A A T G G T A "A GTTA ACCT A Majority 

■ 9660 .9670 '. . , 9 6SQ - - ' 9690. . - 9700 

9639 C G A A A Tfl A T A A T C A G T T C C G T C A TT A T T T T G A A A A T G G T A A G T, T.A ACCT A cohi_al2.seq 
9636 G G A A A T T A T A A T C A G T f C C G T C A T T A T T T T G A A A A T G G T A A G T T A A C C T A a909_ai2. seq 

GCAAC TTCTG. TA T T.A TCCTCTTC A ACAATTCCAT A AATCGACA ATGA ATC Majority 
. 9710 9720 9730 " 9740 9750 * 

9689 GGAACTTCTGTATTATCCTCTTGAACAATTGCATAAATGCAGAATGAATC cohl_ai2.seq 
9686 GGAACTTCTGTATTATCCTCTTGAACAATTGCATAAATGGAGAATGAATC a9Q9_al2.seq 



FIGURE 2iN 



9789 
9786 



Afignment Report of WO 2006/0 78318, method with Weighted residue weight table; ^PCT/US2005/027239 . 

Thufsday^Jirfy 29, 2004 6:49 PM- it ' r«ytsio 

T C T T TP'4I1 1X^23 jjjffiSi gcEl SSj|| TTCTTACTTTCTGCAGTATCTT Majority . 

97 < 6Q 9770 9780 9790 9800 

9739 TCTTTTAAAAGCAACATCACTGCTAGTGTTCTTAGTT T'C TGCAGTATCTT cohl ai2.seq 
9736 TGTTTTAAAAGCAACATCACTGCTAGTGTTCTTAGTTTCTGCAGTATCTT a909lal2.ieq 

TAGATTTTAATACTTCTCTTTGACCATCATCTTTAAAGTGAACAACTTTA Majority 
98*0 982Q 9830 9840 9850 

TAGATTTTAATACTTCTGTTTGACCATCATCTTTAAAGTCAACAACTTTA cohI_.ai2.seq 
TAGATTTTAATACTTCTGTTTGACCATCATCTTTAAAGTGAACAACTTTA a909_ai2.seq 

AGCTT TTCATCTGAAGCTTCTAATCGCTTATCATAGTTGACCTCTACTTT Majority 
•/ " 9860 9870 9880 • 9890 9900 

»839 AGGTTTTCATCTGAAGCTTCTAATGGCTTATCATAGT tQA CCTCTACTTT cohl ai2 seq 
9836 AGGTTTT GAT CTGAAGCTTCTAATGCCTTATCATAGTTGACCTCTACT T T a909_al2.seq 

•T A C T C G G G C.T T G G C G T T C T G C T- T C T T T A C C A T T T G A C T C A, ATAGTAATGT Majority 
' , 99 /Q 9920 9930 9940 9950 

TACTGGBgCTTGGGGTTCTGCTTCTTTACCATTTGACTCAATAGTAATGT cohl_ai2.seq 
T A C T G G G G.C TTGGG.GTTCTC CTTCTTTACCATTTGACTC AATAGTAATGT a909_.ai2.seq 

CATACAGTTTGA AGTTTT T G'A TTTCACTATCTtGTTTACCAACTTCTGTC Majority 

9960 3970. 9980 9990 10000 

9939 CATAGAGTTTGA Afflt TTTTG ATTTCA. CT ATCTTGTTTAGC AA cQ T C T G T C cohl.lal2.seq 
9936 C A T A G A G T T T G A A G .T T T T T G A TTTCACTATCTTG TTTAfC AA.CTTCTG T C a909_al2.seq 

AATGCTTTTTTCTTATACTCTTT AAA .A GTACCTGA ATTCTCTTtTA ATTC Majority 

10010 10020 10O3O 10040 10050 

9989 A A -T G CT T"T TTTCTTAT AH TCTTTAAAAGTAGCTGAATTGTCTTTTAATTC cohl_ai2.seq 
9986 A A T G C T T T T TT C T T A T A G T C T T T A A A A G T A G C T G A A T T G T CT T T T A A T T C a909_ai2.seq 

C C T C A C C TTTA AATCAGCATTTTTAGGAATCTTAGCT.TCTT.TCGTCAAAG Majority 
, " t0060 10070 10080 ' 10090 " 10100 

10 °3 9 CCTCA'CCTTTAAATCAGCATTTITAGGAATCTTAGCTTCTtTGCTCAAAG cohl_ai2.seq 
10036 CC TCA-CCTTTAAATCAGCATTTT'TAGGAATCTTAG.CTTC. TTTGGTCAAAG a909_al2.seq 

T-CACTGTTACA G T- A TAGTCTCC ACC T.C TAAACATCAATCCTTCTTC ACGG Maforltv 
jlOjlO 1Q120. . 1013O • 10140 10150 

10089 TCACtGTT.ACAGTATAGTCTGCACCfCTAAACATC.AATGGTTCTTCAC G_J| cohl_ai2.seq 
10086 TCACT'GTTACAGTATAGTCTGCACCTCTAAACATCAATCGTTCTTCACGG a909_al2.seq 

TAACC AGCTTCCTCAGAAGATGATGTTTCTGTTACACTAGAAGCAGGAGT Maforitv 
• toj60 10170 10180 " 10190 10200 

10139 TAAGCAGCTTC.CTCAGAAGATGATGTTTCtGTTA'CA G.T AGAAGCAGGAGT cohl__ai2 seq 
10136 TAACCAGCTTCCTCAGAAG A-T GATGTTTCT. GT-TAC AC TAG AAGCAGGAGT a909_ai2-seq 

C T C T G G C T T G C T C T G C T C A A C, A C T T G A T T.G A G A- A C T A C A T G T T G. A TGAAG Maforitv 

10210 1O220 10230 ' 10240 10250 

- _i_ 



10189 C X CO G G C X T G X T € T G-C T.C .A A C A C TtG, A T T G A G A A C T A G A T GTt.G A T G A A G eohl_/U2.seq 
" > *°" " " " " ~" ^ A T T-G A.'G A AC t 'A'G A T.G'f TGAT.G A A G '.a9G9__ai2 Jseq 



i0l86 CT G T.G trCTTCCTC T.G C T C A;A C A C T T G A T T G AG A AC T A G A T.G t T G A T G A AG .a909__ai2 lieq 
T TA.CC.TG GJCT AG A A TTT.TT A T T T* T CT A A AG T-A.A TCC CCA.C A T ,c" A T C T G T 6 * Maloritv 

. . 1026Q • 10270 - 10^80 /. * 10290 * 10300 

10239 TTACCTGG CT AGAATTTTTATT T.TCT AAA G T A A T C CC CACATCATCT CT C cohl_.ai2.seq 
10236 TTACCTGGCTAGAATTTTT.ATTTTCTAAAGTAATCCeCACATCATCTGTC a909_.al2.seq 

Tt'A G TTTCT.TCA A C T G TT ATTGCTGGTAG A. A T X A A A. A AA T A A G T C G T T A A Maforltv 

. . J . . 10310 . 40320 10330 10340 10350 

10289 T T A G f TT C TT C A A C T.G T T AiT T G C T G G t A G A A T f A A A A A A T A A G t C G T T A A cohl_.ai2.seq 
10286 T T A G T T T C T T C A A CT G T T A T T G C T G G T A G A A T T A A A A A A T A AGTC G T T A A a909_ai2.se <r 

A A A A C=T T C.TT A G G A T C A T C A A T G A C C A C A T G A T A A T T T T C C A CT C T.T T A G • Majority 

10360 , 10370 . 10380 . 1Q390 10400 

10339 AAAAGTTGTTAGGATCATCAATGACCACATGATAATTTTCCACTCTTTAG cohl_ai2.seq 
10336 AAAAGTTGTTAGGATCATCAATGACCACATGATAATTTTCCACTCTTTAG a909_ai2.seq 



FIGURE 210 



^CT/US200 5 /027239 

o i t c aa »-cyib iuza g a^i gn? nTn„. m ^c»fiTCT T T gj^ : — 

10 ? 10 «O420 ,0430 104 40 ~ l0 450 

JO? 89 G G TCTTTTTTCTTTTTTAATCAT TCGATTATAAAACTTTGAr«rTTr T TT - .., 
10386 GGTCTTTTTTCTTTTTTA ATTATTrr iTTirii? cobl_a!2.seq 

1 rTTAATGA TTCGATTATAAAAGTTTGACACTTCTTT a909_aI2.seq 

A C C A T T T T T C C A T C C. T C C C T A A C C T T A A T T G A T ACTACTAATCTTACCTA » a ,^„ r 
i !°if°. 10470 MM8Q 10^90 1OS0O 

G A C C C C A TAT T C T G A A A C A A A T T T T A C C T A C A ATTTCTTC TTCTGAAACA Majority 
lOflO ,10520 ^0530 ,0540 ~ iolso 

10489 G A C G C C A TATTCTCAAAGAAATTTTACCTAC AATTTfT-TrT-rr-rr. . . > ... . 

10485 GAGGCC'ATATTCTCAAAGAAAT.TTTACCTACAA TT TtlTCtTCTtAAICA aM9Zal2!seq 

TCTCCTACAGAAGTAtTTCGAGAATCAATTGA A CTTTTTCr.GTTGTCTCr: Majority 

. loseo 1QS7Q 1058O losgo ^ 

TAATACAAA AATTTTTTTATCAGGTACTTCATA T CCCTATTTTATATfAC Majority 
10610 10620 1063Q 10640 ~~ ~IZso 

T A T T A C C G A G T C C T T T A T C A A T A A C A T A T G G T T C T T T C A A C. T T A T G T T C A Majority 
. 10 f 6 ° , 10670 106 80 ^o" ~~ ~7oToO 

TTCACGT.AAACAtC.CCCTTG.AfiAATCAATAT T AACCCACTCTC.CTCACTC Majority 
10710 10720 10 730 ". I07 40 ' — ~ 
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TC C A ATA * C C C G C T T G A C T A G G A C C T T A T T A T T C T A G T A A A A C C C C, A r. A A Majority 
I 1076Q 10770 i0780 " ,0790 ~^00 

CGTCTCCAGTTTTAAAATTTGAACCTTTTACTGT A AA-GACTACATCACCT Majority 
'0820 , 10830 " . l08 40 ""iolso 

G C A C TI-A A A G T C T T A T T C A T T G A A T G T C C G T A G A T T C T T A A T A C A C C C A A Majority 
. . , .10960 io&TQ. 10880 10890 '. 1^00. 

S t c'ac T tVaAC ^cnAi^l HfM t £ tTF*^ A G « ATA.CAG G C A A oohl_al2-.s6q- 

«««5 IMC X. T.A A A C T C T T.A T T.C A T T G A A T O T C G G T A GAT T C T T A. A TAG A) G G'C A A d969^ai2;2q 

CCA t A A A A C C CCA A T T.A A A AT CCCT.C Q G A fi G C A A CM C C.A T.C A A G A TO T Majokty 

,10?10 10920 ■ . . 1Q 93Q : . ,0*40 10950 

10889 CC AT A A A ACCC C A ATT A A' A ATCCCTH tTC AGCC A ACCGC C ATr A A c A T r V u. ,■> 
1088S CCATAAA'A Pr«ir>* a tt« • . i _ » _ „ * „ •'""•'"'••H'l'i.lAllAACAICT cohl_al2.seq . 
1U8BS C C A T A A A A C C G C A AT T A A A A TC GC TG T T G A GC C A A C C 6 C C A t C A A C A T G T a909_a!2.s«j 

. AT.ATGCTATTTTTAATOACACTCC A A A A G C G T T T C T C A T A A G T T A C G C G A. Majority . 

' 1Q?70 1Q980, .10990 ^ISoO 

10 ^ 39 A T AT G G T A.-T TT T -T A.-A'T GACACTCCAAA A-G C GTTTC TGATA A TTT-A r r c r a 
.10935 ATA TC G T A T TT tt. , . , , . * 1 - 1 t. A yA-lAA GT-TAC GCGA cohI_ai2.seq 

loaja A T G G T A T T T T T A A T G A C A C T C C A A A.A G C G T.I T C T G AT A A G T T A C C G G A a909_ai2.seq 

TCCACTTCTTC-AG A.T A A T 'T T A T C T.C AACTA A T (i T GT CTTTTC ATTGTCTA Majority 

"020 11030 .11040 ' . UOSO 

I098 ? T'CCAGTTC.TTG AC ATA AT T T.A TCTGAA CTAA T^C TGTCTTTTCATTGTCTA ^hi 
10985 TCCAGTTCTTCAGATAATTTATCTG*ACTAA?ctG?c????ci??G?c?i 



FI GURE 21P 



• '47/427 

i method vaUx Weighted residue weight taWe, 



Mpnment Report <WO 2006/0 783 1 8 *n rnethodvdth Weighted residue weight tatte. ^PCT/US2005/027239 4fl 

Thursday. July 29,-2004 6:49 PM 18 

c c c c T ; £i r'C'oil|r}^;0 Eg ESSI ctttkta aata(;gt A tctgca : - 



. 11070 U08O U09O 



moo 



^ C C C T C T C TTAT CTACTAAATTCTTTACGTTTTCTAAAtAGGTATCTGCA cohi ai2 seq 
11035 CGCCTCTCTTATGTACTAAATTCTTTACGTTTTCTAAATAGGTATCTGCA Sota^^ 

C C . -T AACTCAGCAGTCTCAA AAATACCACTAAGCATTAAGCAAGCTTCCCC Majority 

U ! 10 Ht 20 H130 11140 iTiso 

^^IMS T S A ^ CAGTCTCAA ^ AATACCACT ^ AGCATTAAG GAAGCTTCCGC cohl ai2.seq 
11085 GCTAACTGAGCACTCTCAAAAATACCACTAAGCATTAAGGAAGCTTCGGC aSOsIai^^ 

AATAG AACCACCTT.TCC T A ATTTTGATTTTTTT, ATCATCTAGCGCTTCTT ttatorltv 

t1 ; 60 »™ 11180 11190 11200 

A A Ti^r^^^^ A.GCTTTGC T A A T TT T G A T,T T T T T T A T C A T C T A G CGCTTCtT cohl ai2.seq 
11135 AAT A CAACCACCTTTGCTAATTTTGATTTTTTT ATCATCTAGCGCTTCTT 3909^12^ 

TAACTTGCTGAATCTCTTT CTCTTGTTTTTCAATAAGAAGTTCCTGTTCT Majority 
' l1 ? 10 "220 11230 U24Q U2SO 

11185 TAAGTTGCTGAATCTCTTTCTCTTGTTTTTCAATAAGAAG TT GCTGTTCT a^Ia^s^ 

AACATAATTTCTACCAACTC TTTTCTTTTTA ATT TTTTTAAATCTTCCAT Malorlrv 
11260 11270 .11280 . U290 7l30Q 



-JU. 



1!^ "^^TAATTTCTAGCAACTCTTTTCTTTTTAATTTTTTTAAATCTT CcTf cohl_ai2 seq 
11235 AACATAATTTCT A GCAAGTCTTTTCTTTTTAATT TTTTTAAATCTTCCAT aS^il'Sc] 

■ CUGATTACTTCCTTAACT. GAACCTTAAATTATCGTTTAGATATTATATC Majority 
!1?1° H320 - , 11330 11340 . ~ 11350 

Ii289 CGCGATTACTT.CCTTAACTGAACCT.TAAATTATCGTTTAGATATTATATC cohl ai2 sed 
11285 CGCG AT.TACTTCCTTAAC T G AAC CTTAA ATT A TC GTTTAG ATATT ATA T C SSjS.I^ 

AAA G T T C T A A C C T T T A A A C T C A TT T f T T G T C C T C T G T T T TT T C T C A A A A A Majority 

ll? 60 11370 11380 : ,' 11390 * iUOO 

11339 A A A G T T C T AAC.C TTTAA.ACTCAT.TTTTTGTCCTGTGTTTTTTCTCAA"AAA cohl al2 sea 
11335 AAAGTTCTAACCTTTAAACTCATTTTTTGTCCTGTGTTTTTTCTCAAAAA aSOTL^illlseq 

ACTCTATCCTA A'A TTAAC A TTTTTCATA ATT TT T TGAAAAATCTCATCGA Majority 

... U fl° 11.420 H430 . . 11440 11450 

* rTrl^I^^ AA *T A A" C AT.TTTTGATAAtVtTTTGAAAAATCTCATCGA cohl ai2.seq 
11385 AGTCTATGCTAAATTAACATTTTTGATAATTTTTTGAAAAATCTCATCGA asoalail^ 

AGtCATTTTCTT TTTGAAAGCTCGAATTCTACGCATTAAAAAGCCATATA Majority 
U46Q 11470 U480 U 490 ^ 1^500 

JJS AGrCATTTTCTTTTTCAAACCTCGiATTCTAC,GCATTAAU .A,fiCm.m cohl al2.seq 
11435 AGTCATTTTCTTTTTGAAACCTCGAATTCTAGGCATTAA A aH^BBBBKBB a90dlai2 .seq 

T C.A. A ATTGATATATGGCT T T T T T T A T T A T T T A A A A C A. A AAC AATCAATA G- Majority 
llflP ' ".. 11520 11530 11540 -- U550 



1 1489 VT^_A A A T X G A T~A" T A'T;'G-G C-T-T T.T TT .T'A T T-A. T.TT.T. A A A A C A A A A G- A A T C A A T A k ^ W _ai2.seq 
• U476 ' '^^^■—■^^■■■■■■■■i^^ AC A'A AAGAATC AATAG a9G9„al2.seq 

,G A C A A T A G C f f C A ATT T 'A '<* 'T-. QA -Q'.A :T A A T C KT A jT T.- A -C A C' A TI A. A G T T C I t T . T Majori ty 
\ * - • . U f^° ". ' ' I^TO ': - • tt&SO . '\ •". ' 1X590. ; • - . J1600 - J 

$ A C A A T A G C £ T C A A T T T A G T G A C AT A A T C TAT T A C A GAT T A A GT f C T T T T cdM^i2.seq 
11492 G A C A A T A G CRT C A A T TT A G T G A C aQ A At CTATT AC A G ATTAAG. TTCTTTT .agO^iS} 



T G A A T A A T A T A A T C C A A C T T T T C A A C T G T T T T T T C C C A T G T Q A A A T G T T C Majority 
J 1161Q .111620 , '11630 •- 11640 '■ U650 

I r A * I ^ * I - I A A T C C A A C T 5 T T C A A C T G T ^ T T T T C C GAT CT G A -A A T G T T C cohl- ai2.se*, 
US4a T G A A T A A T A f A A T C C A A C T T T T C A A C T G T T T T T T C C C ABC T G A A A T G T T C a909^i2^ 

T T T A A T T C T T T T A G C A A T A T T C T G T T G T AG TTTC TCT C TT AATGCCTTAT Maforitv 

U66Q 11670 11680 - H690 11700 

JJ639 TTTAATTCTTTTACCAATATTCTGTTGT ACT TTCTCTCTT AATGCCTTAT cohl al2:seq 
11592 TTTAATTCTTTTACCAATATTCTGTTGT AUTT TOT C TCT T AATGCCTTAT a909lai2.seq 



FIGURE 21Q 



\ metho3vwth Weighted residue weioht tah!« ' 



^^m^WLX /07H31S ^"ei^U Weighted residue v^igM table, ' ^PCT/US2005/027239 



u ? 10 "730 U740 ~^" so 

»s«»:sJii?iii?:ii:i:in?iJs:i:?:i;j:isiijjjj S «n-a5S5 

AT G ATGATTCCTAACTCA6GGCTATCAATAACTTr A ACTGTT C CArrr.rr. Hajo,,^ 
Iflf 0 . _ii2Z2 "2° 11790 ^00 

ATCTCTTGCAAT A-A T A C C A C T.C G AAA C TAGACCAGCTTCTAAAATAGACtt Ma jorlty 
• .' 11810 11820 H830 11840 ^ mSO 

T T G C T A A T C C C T C T C G A T A C A T T G A T G r. g T A A A C A A A r. A T A T C T G T C. T GT . Majority 
" 86 ° 11870 M8 8Q T^T -JT^ 

s^:?::;:uu:::i;:h;;::;:::;:iihi»:^*n;:;s?uss» 

GCCATTAAAGACA T-A GTCTGTTCA-A A C T T T A A T TTC CC-CAA AAA CTTA AT Majority 

. . 11920 , 11930 11940 U9S0 

11889 6CCt "*'*CACAIACICIGTTCAAA«TTTA AIItCCCCAAAAArTTtir k. ., 
11842. CCXATTAAAGACATA6TCTGTTCAAACTTTAAtTTCC'C.CAAAAA"GTTAAr "a!^Zal2*seq 

CTGTTTCGACTCATAT TTCT CTT TCA AATCTGCtAATTCACGTCCCTCTG Majority 

.. , U?70 119 8Q , . 11990 HSoO - 

11939 C,T C TTTGCACTGATATTTCTCTTTCAAA fCTGCTAATTCArrTr^rTfTr ,, 
...893 CTGTTTG-GACTGATA TI ICTCITTC AA4I6I6CI AArIC VCOTCCC ICIC SogZalzTseq' 

C T C C * * T C T G T A A A T A A AG A T T T T C A G A G T A C. T C T G A C A T C C A A A A T G C T- Mainrlnr 

■ ■ 12010 . .2020 . 12030 ' ,204c ~^ so 

11989 CTGCAATCTGTAAATAA A-C ATTITC-AGA GTACTGTCACATCfi A A A t r r t ... 

11942 - C TGOAA-TCT<pTA.AATAAAC AT.T T. TCA 6 AST ACTS T8 AC A T'C GA'AAATGCT a909Za!2.seq 

TCTAAGAGCAATTCAATGCCTTTTTCTTTAATA ATTCTArf.AcCATAAr.T Ka forlty - 

12060 ■ ' 2 ? 7 °. .2080 12090 ~lZoo 

.2039 T C'T AAGAGCAATTCAA-TGCCTTTTTCT T'T AATA ATTCTACf i r P ATA A r *r u, ., 
11992 .TCTAAGAG C A ATTCAATGGCTTTTT.CTTTA A-T A A TT CTA C-C AGCATA A G T a909Zal2-sei) 

GATGAAAATATCATCAGCACATTTTTC'AACfi lAAGCCCTGTC A.C C A A A .A T Majority 
12 . U0 ^ „ ,.2.3Q , - 12140 iziSO 

12089 CATGAAAA TA TC AT C A G C A G A T T T T T C A A. G C T A A C C C G T ft T r- A r r a a a at ... 

12042 CATGAAAA TAT. CATCA G'C AGATTTTTCAAGGTAA.GC C fe tBD CACCAAA.A T- aMg^aiisiseq 

C A G AG C CTAGACT. TTCAGATA.CCGAA TT A T A A A T A A C T C. C T t T A C C T T ft T- Majority . 
.;- .2160 12170 12180 ,, ; 12 190 . ~ 

. A'T ATTAAAATG TTT T A A C C A T TC A A C C C T T C T CT T G C A* A C-C G'C AT A A A. A Majority 
- . . ; 12210 _ 12220 ,1223d . ,2240. 12250 

S AT A T T A A A A T G T T T T A A r £ Mg It f* 2 P T ? C T *B C A T A C C C C A T A- A A A cohi.aizlsea 
12142 ATAT.TAAAATGTTTTAACCATTCAACGCT TC T C T T G G A T A C C G C A T A A A A a909_al2.ie<i . 

> ; • AT , C ,J ^ ACGK TACT G C T * AACXCCCCC T-G. T GAGA A- CATC T-'T CAT AG A T A fi Majority 

' '2_ ^ .2260 - ■■■ 1 227D .. .2280. . 12290 ■ " iUoO 

12239 ATCTGGACGATAGTCCTTAACACGCG C'T G T G A G A AC A T G T T r A T A r- A T A J* rJ<hi ^„ 
12192 -ATCTGGACGATAGTGCTTAACACGCGCTGT. GAGAAG AT GTTCAT : AGATAG a909Zal2lseq 

. ' T C C A A A G A A A T C T A A A A A A C G A T t A T T G A C A G A A A A A TC A C T T6A CC C. A Majority 
12 ? 10 '232P 12330 ' ' . 12340 12350 " 

1Z242 CTCCAAAGAAATCTAAAAAA-CGATTATTGACAGAAAAATGACTTGACCCA a909_al2.seq 
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FIGURE 2lR 



- m 

S^^S^WQ jgg6/0^31 8 inm ^^ a ; wcjghted^duew^ht tabte. ™PCT/US2005/027239 ^ 



T G G T C 



r Y ;| G CTTC TTTCCAAAACATACCCC Majority 



« t l2 ?Sq 12390 12400 

12292 TGCTCTAAAACAATACTACGTAAACCGTGCTTCTTTCCAAA.AGATACCCC Sotal^ 

TTCTAGCGTTGTTA ACTG AAAACCTGTATTACAAATCACAAAATCAATAT Majority 
12410 12430 12440 * 12450 

TTCTAGCGTTGTTAAr^ -hi ai2.se* 

12342 TTCTAGCGTTGTTAAGTGAAAAC GTGTATTACAAATCACAAAATC AATAT a909lai2^ 

• TTTCATCTCAAACATCTTTCATCAGCCT CTTGTA T T CTCCATTTTTGTTA Majority 

12460 12470 12480 12490 ' TzSOO 

12439 TTT CATCTGAAACATCT,TTCATCACCGTGTTC TATTCTCCATTTTTCTfl ™hi ^ 
12392 TTTC ATC T G AAACATGTTTCATCAGCGTGTT-GTATTCTC G A TTTTTGTTA a909Zai2lseq 

ATAATAGGA TAGCGCTGCTTGAC A ATGT- T T T T GCTCGCTAAACGGTAAAT Majority 

12 ? 1Q 12 ? 2Q .12530 12540 - 12550 

12489 AT A A T A G G.A T A G C G C TGC T T G A C A AtBtT TTTCGT CGCf A A Af f f T A A at ^ m 

12 *^ ataataggatagcgctgcttgacaatTt^ 

TTTTCTACCCTTGTCTTCATCTAT AATCCCTAAATCATCATGArTACTTC MaWl^ 

- ; '2570 12580 12590 12600 

12S39 TTTTCTACCCTTGTCTTCATCTATAATCGCTAA ATCATCATCATTArTT^ M ^ 
12492 TTTTCTACCCTTGTCTTCATCTATAATCGGTAAATC aQ C ATGATTAGTTG a909^al^!seq 

JT A C A A T A A C A A C A C G G T A G C C A C G C T T A A C C AAA TCTGCTGT C 'A TT T TA- Ualorlt-v 
12610 . *2620 12630 12640 12650 

SS ""££"^1^ cohl_al2: Seq 
1Z54Z TTACA-ATAACAA. CACGGTAGCCACGCTTAACCAAATCTGCTGTCATTTTA a90.9_ai2.seq 

TCTGTATAACGTT CAATAC CTCCGAGGAAGCG TAG A T A A T A TCC TGAGAA Majority 
12660 • * 12670 " - .12680 -12690 



~ • ; — — . . 12700 

S£c?G?^ cohi.ai2.sec 
12592 TCTGTATAACG.TTCAATACCTCCGAGGAAGGGTAGATAATATCCTGAGAA a909_ai2.seq 

AACAGC AAC.TG TTTJYAC CTTATTTT CCA TATTTATC CACITTC AICAAT Majority 

/ 12 T 10 12720 • . • 12730 12740 " 12750 

12689 AACAGC AACT.GTTTTTACCTT-ATTTTCCATATTTATC F A TTTTf A Tr A ,o 
12642 AACAGCAACTGTTTTTACCTTATTTTCCAT A T T T A T CCApTTTCATCAAT a909Zai2]seq 

AAGCCATCTTT TAAGCC TTTAATCATAGCAACTAATTTTTTGCtCTTTTG Majority 
12760 12 770 12780 12790 12800 

12692 A A G C C ATCTTTTA A.G-C OTIT AATCAT ACC A ACT A ATTTTT IGCTC TTTTC a909_ai2 .seq 
CTCTTCTCCTACCA AC A C J C G A A C.A A A TT C A T T T C G C AT A A AT ACT A~A AT Major! ty 

• 'I 12810 . 12820 12830 . 12S40 12850 

mfz c I c r t c let 1 1 ° r A f ^ G A A 0 A A A WAT T T c at a a at a ct-aaa! eohUai2.seq 

12742 C T- C TT G T G C T A C C A A C A C T C G A A C A. A A T TC A T T T% G.C A T A A A T A C I A k A T/igO^iia^ 
At t:t C* GCCGCtT CT. T C TT A C C A T A T T T T T t T A t- A" A T A T A A A T C G C A -T T G MalLl^ 

12860 : • * 2 ?70 12880 12 890 \-/l2900 ' 

I2 ^ 39 A T x T G T G.C C G C t T C T TX T T A C C A T A T T T T TT T A T A A T A t A A A t C C C A f T r «A1 a i9 ™ 
12792 ATTTGTGCCGCTTCTTCTTACCATATTT.TTTTA TA AT AT'A AATCG.fc A T.I .^O^Zaillseq 

CG T ATCATGTA AT ATTT-T C G A A A T G G T G A A T G A TT C A ATACATC A- A A AA C Malorltv 

* . -» . 12910 ., *2920 . 12930 12940 • . 12950 

\f*£ JffliZ^^ cohUai2;seq 
12842 CGTATCATGTAATATTTTCGAAATGGTGAATGATTCAATACATGAAAAAC a909_a!2 .seq 



: A T G G C C A A A T T T f T T A X C T C G T C A A G A G T G T C C A A t T T C G T C T A A A A G A 6 Majority 
. 12960 12970 • 12980 « 12990 * 13000 

I??* 9 !^ C ^^ C !!! TTTTTTUCKCTCAAGACT GTCCAATTTCGTGTAAAAGAC cohl al2.seq 
12892 ATGGCCAAATTTTTTAACTCGTGAAGAGTGTCCAATTTCGTCTAAAAGAC a909laiZ .1^ 
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> A A T A F^^FA'Wli EESg ll ggSBE c C C A T C T C T r. A r. A g I r g g FTT -J- ~ 

- 13010 13Q20 13030 13O 40 ,3050 

TTCATTTCAG.ACTCAACAAAATCAATAAACAT C TCTTnTr.CAAACCCAr. A Majority 

TCTTTCTTCAAAAACGCTCCTTTTCATTAA A CCACCCr.AACTAATACArT Majority 
. * 3 ? 10 13120 13130 . 13140 ,3150 

CTTCAATTTCTTTAT ACTCA AATT C TTCCATCACTAAATCTTCACCGTTr. Majority 

13 . t6 ° 13170 131 80 ,3190 ialoo 

13139 C T T C AATTTCTTTATACfCAAATTCTTCCA tBa CTAAATCTTf If f rrrr ... .-> 
.3092 CTTCAATTTCTTTATAGTCAAATTCTTGCA TCA CTAAATCTTCACCCTTC a909Zal2.seq 

ATATCTTCATACAAACA AC AT A AC ATACCCACCT T ACCTAAATCAAGGTA Majority 
•_ 13210 13220 13230 ,3240 l32 50 

ATTTTCATAATTA TCTATCA AA T CACCTACGACAACCCAATCTTGATCTA u.w,^ 
13260 "P 0 13280 ,3290 - 1330b 



AACTCAAGAACCAATCAAATTCTTCTGCTA C T-G-C A A 'A T T G A C C fi A T A r A ? Majority 
13310 ■ 13320 13330, . • . ,3350 

TTCAAA. GCATATC.CAATTCCTTTATTTTCT G' T T A A ATAATC A ACACTTA (I Majority 

13360 13 37Q 13380 ' 13390 

A^f^^S AATTCCTTTATTTTCTGTTAAATAA TCAA.CAGTTAG cohiai2 sea 

ATATCCAATTCC T T T A T T T T C TC T T A A A T A A T'C A A C AGTTAG So£S^ 



GTGCCCCTCTTCATTA TAATCG GC CA CTAATTGAGAAATTTCTTCCTTAT Majority 
13410 13420 ' .o^n ZTT-^ 



_____ __ 13370 . 13380 13390 , 3 400 

13339 TTCAAAGCATA ~~ 
13292 TTCAAAGCATA 

CTTCATTATA 

13 f l ° * 342Q . 13430 13440 13450 

13389 CTGCCCCTCTTCATTATAATCGGCC ACTAATTGAG.AAATTTCTTrrTTA-r k, 
13342 *TGCCCCTCTTGATTATA;ATCGGCC^ 

TTTTCGAGCC ATT ATCTAC G ATGTA GAT ATGG CTTACTTG AGGATA AATT Majority 
13460 13470 134 80 . , 13490 13500 

m-m^-ni urn mm t ? si ; j ffi IHSfi sffsssss;. 



• j ' CC T ^ C A.A T C TT C T C A T C T:A A C UHC A A T X TtI-GGCTT A A Aft GT fa A OA A ». Majorlity 

^ -* 10 . ■ ' ^20' . 13S30 , ' , ., 13540 l^SO . 

13489 G C T CC A A T CI TC-TClf CTAACCGT T C A A T A T T G C C G T T A A If CTr ki i . 4- ,o 
13442 G C T C G A A T GT .ICTCATQTAACCCITCAATATTCOCC XT AAA G'C'J'G AC AA T. d909ljal2!seq 

ACCCGCTA AATATTTCATG.TTCT ATCCTCTTT T .CTAAA ATCTCTA A ATA A HaJ6 f Hy . 

. 13560 13570 ' ,35 80- . 13590 13600 

13539 ACCCGCTA A AT AHT.C ATCf TCTAUCICi ritCIA A A ATf Tf f » 1 i rt 1 u. .o 
13492 ACCCCCTAAAT A'T T T C A T Ct T-CT'ATCCT C T T^TCTAAAAT CT C T A' A A-TA;A dJJo'^Ljallziseq 

■ QC t G A A T G A C T C G T G 0 T T T G C T T A T A A A A A C. G A T A C C G A C A T A G A T A ft T T- A Majority 
13610 13620 ,3630 ,3640 
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C T C C T)|A ^ >f y WjU H ggggS^C jgf| jgjjr ACCAATfi ATACTCACATTTCA Majority ' '' ' 

_ 13660 '367Q 13680- 13690 13700 

S r^r^r^^^ TTGAAT ^ AC A T A A T T T * C C A A T G A T A C T C A C A T T T G A cohl_al2.seq 
13592 CTGCTACTAAACTTTGAATCACATAATTTACCAATGATACTCACATTTGA a0O8L»12.seq 

CTATTGATATA ATA GAGTACAGCTCCACTAAGAGTAGCAGCAATTAAATA Majority 
13 7 1Q 1372Q 13730 r3740 , 137SO 

g2 c?i???A? A i A *i yCAGCTCCACTAAGACTAGCAGCAATTA A A i A coWjdZ.aeq 

13642 CTATTGATATAATAGAGTACAGCTCCACTAAGAGTAGCAGCAATTAAATA a909_ai2. S eq 

GCCCAGCATTCCfCTTGTTAA T TCTTTAAAAGTAAATACATCTCTTAAAG. Majority 
13760 ^770 13780 13790 . ^3800 

)l™ GCGCAGCATTCCTCTTGTTAATTCTTTAAAAGTAAATACATCTCTtAAAG*" cohl a!2 S6a 
13692 CCGC AGC A-TTCCTCTTGTTAATTCTTTAAAAGTAA A.TAC ATCTCTTA'AAG SffcS^ • 

A G A T A G C T T C A T A T A G G G A C A C A A T A A A T T C A G T A A T A A C T G T A G A G A T A Majority 

. 13810 13820 13830 13840 13850 

13789 AGATAGCTTGATAHAGGGAGACAATAAATTCAGTAATAACTGTAGAGATA cohl a£2 sea 

13742 agatagcttgatatagBgagacaataaattcagtaataactgtagagata SSii^ 

ATAGCTCC C A TAGC ACCTAAAATTGGTATTAAAAGTA T A TTAAGCACAAC Majority, 
- 13860 13870 13880 13890 13900 

J22 'iTi^Trro * ** cohl a!2.seq 

13792.ATAGCTCCC ATAGCACCT A A A ATTGGTATTAA A AGTAT ATTA AGCAC A AC aaOB^tzT^ 



seq 

ATTTCCCACA.AGTCC A AT AAC-TGCA GACATTGTGTA AGCT-TT TG T A C G t C Majority 
13910 13920 13930 13940 13950 - 



.,o„ ATTTCCCACAAGTC.CAATAACTGCAG.ACATTGTGTAAGCTTTTGTACGTCcohl a!2 seq 
13842 ATTTGCC AC AAGTCCAAT A ACTGC A GACATTGTGTA AGCTTTTGTACGTC So£5£2!l 

* TTG AAGCCAGTAGATACTG TGTCC€TAAAGCGTTACCATAAGAAATGCAA Majority 

13960 13970 . , 13980 • 13990 ^ -14000 

TTGAAGC.CAGTAGATACTGTGTCCCTAAAGCGTTACCATA AG- -A A A T G C A A cohl_»12.seq 
13892 TTG AACCCAGTAG ATACTCTGT.CCCTA AAGCGTTACCATAAG A AATGCAA So^l^ 

ATGATC ATC AAA . Majority 

14010 



13989 ATGATC ATC AAA . _ |3> m 

13942 ATGATC ATC AAA . So^I.'s^ 

Decoration 'Decoration Shade {with solid black) residues that differ from. the Consensus. 
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MKLSKKLL jgl jETjl »j ggjj gH g gj MS T V RAAEVSOERPARTTV ^ 

J52 Bg y 40 ■ so 



M K L ITIlTT!TFyT ijij A «STVEPVAQFATCMSIVRAAEVSQERPAKTTV sag645_2603.pep 
K p lUl AGSTVEPVA< J FAT G"SIVRAAEVSQERPAKTTV ^sla^.^ 

^ J LF Si ACSTVEPVAQFATGHSIVRAAEVSQERPAKTTV saS^cj bll^ep 

uv^r£I",£f AA y, ^I MVAGSTVEPVA0 - FA TCMSIVRAAEV.SQERPAKTTV sag645 cohl.pep 
UKLSKKLLFSAAVLTMVACSTVEPVAQFATCMSIVRAAEVSQERPAKTTV sa^sIne^lG^ep 

NIYKLQADSYKSEITSNCC I ENKPGEVISMYAKLGDNVKCLQCVQFKRYK Majority - 

52 7 P «p 90 100 

NIYKLQADSYKSEITSNCGIENKBGEVISNYAKLGDNVKGLQCVOFKRYK sac645 2603 oeo 
N I Y.KL? A DS YKSE I TSNGG I ENKDG EV ISNY A KLGDN VKGLQCVQFKR YK 2 S'S 

J^!! DS " SEITS »" IE »MCEVIS,.VAKICD»VKCL C . K T S^bllfpep 
^v5. L S ADSYKSEITSNCGrEN ' KDGEVrSNYAKL «'"'VKGLQCVQFKRYK saSs^h peT 
NIYKLQADSYKSEITSNGGIENKDGEVISHYAKLGBKVKGLQCVQFKRYK Z ^slSla i^p 

VKTDISVPEL KKLTTVE AADAKVCTILEEGYSLPOKTNAOCL, VVDALDSK Majority 
M° 120 130 140 150 



01 
01 
01 
Oi 
01 



-I _L_ 



IrlS cIS^f tLTTVEAA ° AKyCTU - EE « VSLP( } |tT » A «GL¥VDALDs i sag645 2603.pep 
VKTn¥^vn'!fi'!r!r^TTvij A * " AK VGTILEEGVS'LPQKTNAQGLyVDALDSK sag64sZa909 . pep 
^^nTfvSnl^,n^ AADAKVGTILEEGVSLP « KTKA « GLVVDA LDSK ^S.cjblinep 
r^nIfI^, L ^. LTTVEAADAKVGTItKEGVSLP Q K ™ A< JCLVVOALDSK Sag645 cohl .pep 
VKT.DISVDELKKLTTVEAADAKVGTILEEGVSLPQKTHAQCI.VVDALDSK safeneJil^ep 

S N V R Y L Y V E D L K N S P S N I T K A Y A V P F V L E L P V AN S T G T C F L S B I N I Y P K H Majority 

160 '?° 180 . 190 200 

51 SNVRYLYVEDLKNSPSHITKAYAVPFVI.E LP VANSTCTCF LSE I N 1 IP K N sae645 2603 oeo 
51 SNVRYLYVEDL K.N.S P S N I T K A Y A V P F V L E L P V A N S T C T G F I S E I H I Y P K N S^s"^'™ 
SNVRYLYVEDL KNSPSNITKAYA'VPF V.L ELP VA HSTGTCFLSEIB I Y P KM SSrclburL, 
SMVRYLYV. EDLKNSPSMI TK AYAVPFVLELP V'A H S T G T G F L S E 1 N I Y P K N 2« J.U pep 
SHVRYLYVE-DLKN S P SNITKA'YAV P F VLEIP V-A KSTCTCF L'S E I N I Y PK N ^S^ifpep 

VVTDEPKTD KD VKKLGQB:D ACYT I'CEEF K WFXK ST I P A N L C D Y E K F E I T D Majority 

■ 2 f0 230 . 240 -250 

VVTDEP. KTDKDVK.KLGQ.DDAGYT I GEBFKWFLKSTIPANLCDYEKFE I T D sae64S 2603 oeo 
VVTBEPKTDKDVKKLGQDDAGYTIGEEFK. WFLKST1PANLGDYEKFEITD S^Kol'peS 
VVTDEPKTDKDVKKLGQDDAGYTIGEEFKWFLKSTIPAHLGBYEKFEITD SrfWclM i f L> 
VVTDePKTDKDVKKLGQDDAGYTIGEEFKWFLKSTIPAHLGDYEKFEltD sa^S^h Pep 
VVTDEPKTDKDV'KKLGQDDAGYTIGEEFKWFLKSTIPAKLGDYE.KFEITD S^S^i^Pep 

K F A PC IT V K S VCK IK I GSKT LNRDEBYT IBEPTVBNONTLKITFKPEKFK Majority 

- " 260 2J0 " 280- 290 300 

nKFADGLTYKSVCKIKlGSKTLMRDEHYTIDEPTVDNQMTLKITFKPEKFK sae645 2603 beo 
ilKFADCLTYKSVGXrKIGSKTLHRDEHYT.rDEPTVDHQNTL.KITFKPEKFK Z&rt J^^Z 
51 K F ADC LTYKSVGKIKICSKTLNRDEHYTIDEPTVDNQHTLKITFKPEKFK SS^clbiu Lp 
51 • K F ADG LT.YK.SVGK IK I CUT L N R D EHYT IDEPTVBNQNTLK I T F IP E K F K Sola *>h peT 
51 K F A D C J. T Y K S V G K I K I G S.K T L N R B E H Y T I B E P T V B H Q | ■ T L JC I T F K P. I K F=K SglsZSo^ep 

, E j A EL U C M,T L V K H Q 0 A,L BK AT AN T D D. A A F L E I PV AST ij E K A V X C R A I B Matoritv 
310 320 330, -- , 346 . , . 350. ^ 



Si 
SI 
51 



II 
)1 

II E 
H 
II 



I J" l C H I L V K Q D A V !•' K A T A N T D B A A F LElg VAST I NEK A Vt G K A IE sag645 2663.pe P 

^ J * SI* I" ^ 55 I M K 2 Q B A L " K A T A N T D! * A A P L E I P V A S T I H E K A V.L C,K I JE sk§45 _ a9b9 pep 
"nT A !?i'^Ir ^ A ^^ A ^ A ^^^^ AAE kE4:PVASTINEKAV"LG'K A I E . sa^45 - cj bill pep 

E. I A E L L K G I T L V K N Q D A L .D K .A T A N T D D A A F L E I P V A S T I N E K A V L G K A I B saS^S ~cohl 
E I A E L U CUT LVKN.QDA. LDXATANTDDA API EIPVASTJNEKAVLGKAIE Sg'fcS^ie^ep. 

,N T F S L Q Y D H T P D K A D M P K P S N P P R K P E V H T C G K R F V K K B S T E T 0 T li G C A E Mktorltv 

' . " 360 . 370 . ■ 380 -390 .. ■". „ . ■ ' ..400 . 

"^fff^y^yY^y^Fiy 7 ^^^ *ag645_2603.pe P 
mId-cI'aw "PPRKPEVHTGCKRFVKKBSTETQTLG'GAE sai645 a909.oep 

^r^^ D ~ T ^ DKADNPKPSKPPRKPEV,tTGGKRFVKKDSTET< JTLGGAE aa^cj bl fpep 
h t ccf'^vniiTD P E ^ E ^ ^"' G KRFVKKDSTETQT L G G A E Sa |645~coKl.p;p P 

NTFE.LQYDHTPDKADNPKPSNPP.RKPEVHTCCKRFVKKDSTETQTLGCAE saieiSje^pepV 
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SK^l^ 7 . *PCT/US 2 00 5 /027239 ^ 

FDLLASDG j ^Ej jggggl j^g HI I gj VTCOP r K" f K S K-T P G T F F. T K ^ ^T^ ' 

4 » 8 420 43a 440 ~Zo 



Bl FDLLASBGTAVOTD«l J^t!^^ TC,!PIICL1CSBTDCTFEn( 3ag64S_a909.p^ 

lOl FDLLASDGTAVKWTDAl. I.KANTNKNYIAGEAVTGQP I KLKSHTDCTFE IK sa^45Zns«i3t6?pep 

LAYAVDAWAEGTAVTYKLKETKAPECYVIPDK E IEFTVSOTSYNTKPTn Majority 

: f£ 4 T° 486 49 0 soo 

^ * YAV P AN AECTAVTYKLKETkAPECYVIPDKEIEFTV SOfSYHTKPTn ,«n, 

LAYAVDANAEGT: AVTYK'LKETKAPEGYVIPDKEIEFTVSOTSYMTKpTn f^f" J ^ Up<5P 
L4IiVD4lliBST|TTTKll[EIICAPECTVIPJIBlEFTTSQTSt N •? K P T D Sg645^eB316^ep 

ITVDSADATPDTI'KHMKRPSIPMTGGICTAIFVA T CAAYMAFAV.KGMKRR Majority 
2? .530. 540 550 



G 



151 G 
151 G 
151 G 
51 G 
51 G 



T KPN 



Majority 



51 T K D N 

51 T K D N sag645_2603.pep 

51 TKDN sag645_a909,pep 

51- TKDH sag645u.cj bill. pep 

51 TKDN ■ - sag645_cohl .pep 

. sag645_nem3 16 . pep 

BGoration 'Decoration. «• :Shade (with solid black) residues that differ from the Consensus. 
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SI 
51 
51 
51 
-51 
51 



101 
101 



? * ^ 40 50 

S K K ROKIWRGLSVTl"i*i"f*Qn r»cr* JLVQCETQDTNQALGKVIVKKTGDNA *ag649^603.pep 

MKKBftKiwB^t'oTrT LSQ IPFG 1LVQGETQDTNQALGKVIVKKTGDNA sag649 cohl Zl 

llllll SrGLSV^J ^2 J ILVQGETQDTNQ A L G K V I V K K T G D N A ^ l^p 

MKKROKIWRrf^^VTi^r^T ^ o « In ILVQGETQDTNQALCKVIVKKTGDNA ^oIlSx^ 1 . 

„ IPFG ILVQGETQDTNQ'ALGKVI VKKTGDNA 3ag649_ c jbn 1 .pep 

TPLGKATFYIKM DNDK SET S H ETVEGSGEATFEK I KPCDYTIREETAP 1 C Majority 

— gg y *> . so 

T P L G KATFVLKNDNDKSETSHETVEGSGEATFEK 1KPG DYTLREETAP T r <^««40 o«no 

TPUKATFVLK.NDNDKSETSHETVEGSGEATFE KP T tJ r ^SU^S?* 1 "* 

~ TPLGKATF. VLKM DHDKS ETSHE TV EGSGEATFEN IKPGDYTLREEfAP Ifc SSkM if* 

51 TPLGKATF.VLKHDNDKSETSHETVEGSGEATF1N I XFGITTL1EBT AP I r Z&fruFS'*** 

51 TPLGKATFVLKNDNDKSETSHETVEGSGE TFE no U G tSS*^*** 

si t^lgkatfvlkndndksetshetvegsgeatfe^kpgS^lrI" 

YKKTbKTffKVK VADNG ATIIEGMDADKAEKRKEVLNAQYPKSAlYEPTKE Majority 
'- ! *° ^0 130 140 150 

?"?!!£tI£vSv!S;?f? J ^"?^ KAE ^ RK HVLNAQYPKS A I YED.TKE ^49,2603. pep 
101 I KKTBXT.WKVKVAO NGAT I IEGKDA D K AERR'KEVLN AOYPKS A I Y P n T k k ^JZrjq 
101 YKKTDKTWKVKVADNGAT I -I EGKDADKAEKRKEV L N AOYPKSA I YEDTKF t3£ii W 
101 YKKTDKTWKVKVADNGAT I I EGMD ADKAE V AO Y K ! JedJkp S^o~ !SS ?PeP 

101 YKKTDKTWKVKVADNGAT I I E G ED A" D K AE K R K E V L k A Q YP K S A I YE'DTKE sa^49^^bllKpep 

NYPLVNVEGSKVGEQYKALNP IHCKDGRREIAEGWLSKKITCVHPLPKHK Majority 
*f° : . *?0 , ISO • 190 . . 200 

151 NYPLVHVEG.SKVGEQYKALNPINGKDGRREIAEGWLSKKITGVNDLDKNK <=^4Q ^n<* ™» 
151 HYPLVNVEGSKVG'EQY'KALNP INGKDGRREI aegwlsk kQ TGV^DLD'KHK S^49^W * ~n 

-tt V p r*vwvno«^^^^*^^^ ^ ^ P * N G K.D G R R E I A E G W L S K KIT GVNDLDKNK ^a^49~ cjblll^pep 
I« ^aJ-^S« EGSKV CEQ YK AL KP ? NGKDGRRE 1 AEGWLS KK ITGVND LD K N K S^Ti r^2 
1S1 N YP L VHVEGSX VCEQ Y K. A LHP I RGRDGXS E I AE GUSH I TG V N D L D K N K ^4q"^Tfi ™ 
151 N Y P k'^.N V E G'S K.V G E Q.Y KALNP I NGKDGRRE I A E G V L S K'K I TGVNSLDXN'K ^^49Z<cJblli;pep 

Y K I E I T V E G K T T V E T K E L N 0 P L D V V V L L P i N S N S M N N E R A N N S Q R A L K A G E Majority 
' ^ fJO 230 240 So 

12} ^k! S^vp?"IL E TKELNQPUVVVLLDNSNSMNHERaInSQRAUAGE 3ag649 2603.pep 
201 YKIELTVEGKTTVETKELNQPLDVVVL LP KSKSMNNERANNSOPATRTACP o.Lq" m 
201 YK I ELTVEGKTTVETEELNQPLDYVVL'LDSSVSItHSEXAXRS 0 RALK ACE ^4^h,,T P 

^^I2^n"""5 TKELK ^^ 335649^16.^ 
201 YK I ELTVEGKTTVETKELNQPLDVVVLL-P* N'S NSKN'NE RAKKSQRALKAGE ^g^JblU^ 

AVEKLIDKIT SMKPNR VALyTYASTIFDGTEATVSKGVADONGKALKDSV Majority 

. 2 ?° 2 T° . 2?0 290 300 

251AVEKLIDKITSN. K» M RV A L VT Y A S T I F D GTE A T VS K G VAD Q N G K A LN D S V sapfi49 ZfiOl o^o 
Zbl A.VEKl I D K I T S N K D.N R V- A t V T Y A S T I FPGTEAT VS K G V-A DONGKALN DSV S^R4Q r>iKiii -™ 

251. A V E .K L:I 0 K I TS N K D K R V A L VT Y AS T I F D G T E A T VS K G V A D b N CK A f H n ^ I-SSlS^ 
.2Sr A V E K LI D.K.l hit,, RV A L, T T > S T I G TE A I V S'{ t V A nIciIi* D I V ^S^^ 

. S W ,° - " K T T F l A T T B M Y S lik« L X lL" * »"g V N I L K S R I P K E A E H I M CD R T. L Y.O l^loritv.. 

. " 310 . •. a^n- " .' . ' ■ . • 



301 
301 



3 |° • • 32° . . -. aact . 340 350 



— A : , ■ ■ .-». ■ 

SWBTHKTTFT. ATTHK Y S Y L B L T MD A H E V N I L K S R I P. K E A E IT I k G D R T L Y 0 sae649_2603 lio 

301 S W D Y H K T..T P. T A T T H M Y S Y L X L T H D A H E V N I L K S R 1 V K E A E H I M G D R T L Y 0 SS49"coh?'«o 

301. S W D.Y H K T T F T.A t .I It N Y S Y I H L T K.D A M E V H I L K S R I 'P K E A B H I N G D I t L X 6 SSlo^biir-L* 

301 SI 0YHKT.TFTATTBNYSYLJ1I.T. NHAKEVN. I LIS R IP K E A E II I NGD RTLYQ sae649~ 18rs21 "dcd 
301 SWDYHKTTFTATf HNY.S YLHL.THDAKEVN ILKSR IPK E'A III ICDSf 1TQ » 
301 S W D.Y H KT T E T A T T H * Y S Y L K L.T K b AH E-V N I L K. SRI P IIAEII1CDITLT JSKS^' 

F C A TP TQ K ALII fc A ME 1 L E T Q S S H A R K K L I F H V T D G V ft 1ST A IN F M P Y. I S lfalorttv 



3 ™ 3|0" 390 . 400 



351 FGATFTQKA L'M KANEILETQSSNARKKLIFHVTDGVPTMSYAINFNPYI S saa649 zetn „«, 

351 F G A T F.T QKALI1KANEI LETQSSNA R.K KL IFHVTDGVPTHSYA IK F N P Y I S 

^! pr^Tcxntr^^'"^*^^ ' LETQSSNARKKL I FHVTDGVPTUSYAI NFNPY I S sa^49~cjbll^pep 

351 FGAT.FTQKALUKANEI L.ETQSSNARKICLI FHVTDGVPTMSYAI NFNPY IS sa^O^JblU.^p 



- 401 
401 
401 



451 
'451 



^SSSy 1 j^^SL^m 6/0 783181 meth <*^ PAM25 ° **«gM tabfe. ' ^PCT/US2005/027239 2 

T S Y 0 K 9 pgfe^pQ^ g Sj * pi^"t gUS D DY Q lVKGDCESFKLFSDRKy Malorltv - 

4 f° jfO 430 440 450 

4^1 I 2 21* WSFLNKlPDRSCfLQEDFIINGiDYQ^KGDGESFKLFSDKKV sag649 2603.pep 

TS-ToSSfSIflSk pSp^I till** 1 INGDDY ^ « VKCDG.ESFKLFSDRKV^9jUS 
T^vnMn^M^cT^Mtr^ LQEDF I INGDDYQ IVKGDGESFKLFSDRKV sag649 nemSlG.oep 

TSYQNQFNSFLNK IPDRSG ILQEDFI INGDDYQ IVKGDGESFKLFSDRKV 3agWcJbm ^ 

.PVTCGTTQAAYRVPQNOLSyM SNEGYA I NSGY I YLYWRDYNWVYPFDPKT Major! ty 
__ 4 f° 4TO 480 490 500 

PVTGGTTOA^YlfVPn^ NSGY- I YLYWRDYNWVYPFDPKT sag649^603.pe P 

-P V T G G T-T QAAYRVPQNQLSVHSNEGYA I NSGY I Y L Y W RD YNWVYPFDPKT sae649 oohl nln 
451 •PVTGGTTQAAYRVFQIfQLSVUSNE.GYA I MSG Y I YLYWRDYNWVYPFDPKT S^ifLo 
451 PVTGGTTQAAYRVPQ NQLSVMSNE G*Y AXNSGY I YLYWRDYNW VT P FDP K T 2^49^^ 
-451 PVTGGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLY WR D Y NW V Y P F D P K T SS^^G^n 
451 PVTGGTTQAAYRVPQNQLSVMSNEGYA IKS.G Y I YLYWRDYNWVYPFDPKT sa^49^cjb^ll ]pep 

K K V SATKQ I KTHGEPTTLYFNGNI RPK G YD I FT VG IGYNCBPGATPLEA E Majority 

. S j° S ?° 530 ■ 540 550 

501 KKVSATKQ I KTHGEPTTLYFNGNI RPKGYD I FTYGIGVNGDPGATPLEAE sae649 2603 oen 

2! KrvUTv'nirTwr^^ ' RPKGYD I FTVG IGVNGDPGATPLEAE* 2!^49~cohl !pep 

501 KKVSATKQ I K T.H GEPTTLYFN G'N I RPKGYD I FTYGIGVNGDPGATPLEAE sae649~clbll loeb 

SOI KKVSATKQ 1 KTHGEPTT-L-YFN G N I RPKGYD I FTVG IGV N G DPGATPLEAE saS49~I8rs21 d^p 

501 -KKVS-ATKQI K-THGEPTTLYFNGNI RPKGYD I FTVG -IGVNGDPGATPLEAE SS^^^s'Sd 

501.KKVSATKQ I KTHGEPTTLYFNGNI RPKGYD I F T V. G I G V N G D P G A I PL I A I SS^J 

KFMQS I SSKTENYTN VDDT NK I YDELNK YPKT I VEEKHS 1 VDCNVTD P HC Majority 

' . S6 ° ] 570 580 * 590 . 600 

^2ncJoo^ ENYTNVDD ™ KI YDELNKY FKT I VEEKHS I VDGNVTDPMG sag649j2603.pep 
; SSK ^ EN YTN VD »TNK I YDE LN-KY FKT I VEEKHS I VDGN'VTD PUG ^oiohlvpep 
KFMQS I S SK- TEN YTN VDDTNK IYDELNKYFKT I VEEKHS I VDGNVTDPMG saR649~c! bill oeo 
KFMQSISSKTENYTNVDDTNK IYDELNKYFKT IVEEKHS IVPGNVTDPMG SS^lr^ 1"S 
551 K F M Q S IS.S-KTENYTNVDDTNK IYDELNKYFKT I VEEKHS I VDGNVTDPMG SSl^G "™ 
551 KFMQSISSKTENYTNVDDTNK I YDE LNKYFKT"! VEEKHS IVPGNVTDPMG sa^49l]cJWll [pep 

E M I EFQ L KN G Q S FTH P D Y V L V G N D G S Q L K N G V A L G GPN S D G G I L K D VT VT Majority 

6 *° ' . 620 630 * 640 650 . 

601 EMIEFQLKNGQSFTHDDYVLVGNDGSQ L K NGVALGGPNSDGGILKDVTVT 3a*649 2603 pen 

^SSI^^5«5? SFTHDI)YVLVGNI)GSQ L K N C V A L G G P N S D G G I L K D V T V T sa|e49 cJblllTL, 
^! i E f |j!jk-KN -, GQSFTHDDYVLyGNDGSQLRNGVALGGPNSDG.G-ILKDVTVT sa^49~l8rs21.pep 

cni c u x E ^!^?'^^- G ^^. E ^^^. D ^^L^ G N® G SQLKNGVALGGPNSDGG I LKDVTVT 3^49^316^0 
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